(predictive) purposes to thereby treat an individual prophylactically. Accordingly, one 
aspect of the present invention relates to diagnostic assays for determining INTERCEPT 
340, MANGO 003, MANGO 347, TANGO 272, TANGO 295, TANGO 354, or TANGO 
378 protein and/or nucleic acid expression as well as INTERCEPT 340, MANGO 003, 
MANGO 347, TANGO 272, TANGO 295, TANGO 354, or TANGO 378 activity, in the 

^ context of a biological sample {e.g., blood, serum, cells, tissue) to thereby determine 
whether an individual is afflicted with a disease or disorder, or is at risk of developing a 
disorder, associated with aberrant or unwanted INTERCEPT 340, MANGO 003, MANGO 
347, TANGO 272, TANGO 295, TANGO 354, or TANGO 378 gene expression or activity. 
The invention also provides for prognostic (or predictive) assays for determining whether 

^ an individual is at risk of developing a disorder associated with INTERCEPT 340, 

MANGO 003, MANGO 347, TANGO 272, TANGO 295, TANGO 354, or TANGO 378 
protein or nucleic acid expression or activity. For example, mutations in a gene can be 
assayed in a biological sample. Such assays can be used for prognostic or predictive 
purpose to thereby prophylactically treat an individual prior to the onset of a disorder 

* ^ characterized by or associated with protein or nucleic acid expression or activity. 

As an alternative to making determinations based on the absolute expression level of 
selected genes, determinations may be based on the normalized expression levels of these 
genes. Expression levels are normalized by correcting the absolute expression level of a 
INTERCEPT 340, MANGO 003, MANGO 347, TANGO 272, TANGO 295, TANGO 354, 

20 

or TANGO 378 gene by comparing its expression to the expression of a gene that is not a 
INTERCEPT 340, MANGO 003, MANGO 347, TANGO 272, TANGO 295, TANGO 354, 
or TANGO 378, e.g., a housekeeping gene that is constitutively expressed, Suitable genes 
for normalization include housekeeping genes such as the actin gene. This normalization 
allows the comparison of the expression level in one sample, e.g., a patient sample, to 
another sample, e.g., a non-disease sample, or between samples from different sources. 

Alternatively, the expression level can be provided as a relative expression level To 
determine a relative expression level of a gene, the level of expression of the gene is 
determined for 10 or more samples of different cell isolates, preferably 50 or more samples, 
prior to the determination of the expression level for the sample in question. The mean 
JU expression level of each of the genes assayed in the larger number of samples is determined 
and this is used as a baseline expression level for the gene(s) in question. The expression 
level of the gene determined for the test sample (absolute level of expression) is then 
divided by the mean expression value obtained for that gene. This provides a relative 
expression level and aids in identifying extreme cases of disease. 

Preferably, the samples used in the baseline determination will be from diseased or 
from non-diseased cells of tissue. The choice of the cell source is dependent on the use of 
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the relative expression level. Using expression found in normal tissues as a mean 
expression score aids in validating whether the INTERCEPT 340, MANGO 003, MANGO 
347, TANGO 272, TANGO 295, TANGO 354, or TANGO 378 gene assayed is diseased 
cell-type specific (versus normal cells). Such a use is particularly important in identifying 
whether a INTERCEPT 340, MANGO 003, MANGO 347, TANGO 272, TANGO 295, 
TANGO 354, or TANGO 378 gene can serve as a target gene. In addition, as more data is 
accumulated, the mean expression value can be revised, providing improved relative 
expression values based on accumulated data. Expression data from cells provide a means 
for grading the severity of the disease state. 

Another aspect of the invention pertains to monitoring the influence of agents (e.g. , 
drugs, compounds) on the expression or activity of INTERCEPT 340, MANGO 003, 
MANGO 347, TANGO 272, TANGO 295, TANGO 354, or TANGO 378 genes in clinical 
trials. 

These and other agents are described in further detail in the following sections. 



1* Diagnostic Assays 

An exemplary method for detecting the presence or absence of a polypeptide or 
nucleic acid of the invention in a biological sample involves obtaining a biological sample 
from a test subject and contacting the biological sample with a compound or an agent 
capable of detecting a polypeptide or nucleic acid {e.g., mRNA, genomic DNA) of the 

20 

invention such that the presence of a polypeptide or nucleic acid of the invention is detected 
in the biological sample. A preferred agent for detecting mRNA or genomic DNA 
encoding a polypeptide of the invention is a labeled nucleic acid probe capable of 
hybridizing to mRNA or genomic DNA encoding a polypeptide of the invention. The 
nucleic acid probe can be, for example, a full-length cDNA, such as the nucleic acid of SEQ 
25 IDNOsrl, 3, 4, 6, 7, 9, 10, 12, 13, 15, 16, 18, 19, 21, 22, 24, 25, 27, 28 or 30, or aportion 
thereof, such as an oligonucleotide of at least 15, 30, 50, 100, 250 or 500 nucleotides in 
length and sufficient to specifically hybridize under stringent conditions to a mRNA or 
genomic DNA encoding a polypeptide of the invention. Other suitable probes for use in the 
diagnostic assays of the invention are described herein. 

on 

A preferred agent for detecting a polypeptide of the invention is an antibody capable 
of binding to a polypeptide of the invention, preferably an antibody with a detectable label. 
Antibodies can be polyclonal, or more preferably, monoclonal. An intact antibody, or a 
fragment thereof {e.g. , Fab or F(ab%) can be used. The term "labeled", with regard to the 
probe or antibody, is intended to encompass direct labeling of the probe or antibody by 
35 coupling (i.e., physically linking) a detectable substance to the probe or antibody, as well as 
indirect labeling of the probe or antibody by reactivity with another reagent that is directly 
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labeled. Examples of indirect labeling include detection of a primary antibody using a 
fluorescently labeled secondary antibody and end-labeling of a DNA probe with biotin such 
that it can be detected with fluorescently labeled streptavidin. The term "biological sample" 
is intended to include tissues, cells and biological fluids isolated from a subject, as well as 
tissues, cells and fluids present within a subject. That is, the detection method of the 
invention can be used to detect mRNA, protein, or genomic DNA in a biological sample in 
vitro as well as in vivo. For example, in vitro techniques for detection of mRNA include 
Northern hybridizations and in situ hybridizations. In vitro techniques for detection of a 
polypeptide of the invention include enzyme linked immunosorbent assays (ELISAs), 
Western blots, immunoprecipitations and immunofluorescence. In vitro techniques for 
detection of genomic DNA include Southern hybridizations. Furthennore, in vivo 
techniques for detection of a polypeptide of the invention include introducing into a subject 
a labeled antibody directed against the polypeptide. For example, the antibody can be 
labeled with a radioactive marker whose presence and location in a subject can be detected 
by standard imaging techniques. 

In one embodiment, the biological sample contains protein molecules from the test 
subject. Alternatively, the biological sample can contain mRNA molecules from the test 
subject or genomic DNA molecules from the test subject. A preferred biological sample is 
a peripheral blood leukocyte sample isolated by conventional means from a subject. 

In another embodiment, the methods further involve obtaining a control biological 
sample from a control subject, contacting the control sample with a compound or agent . 
capable of detecting a polypeptide of the invention or mRNA or genomic DNA encoding a 
polypeptide of the invention, such that the presence of the polypeptide or mRNA or 
genomic DNA encoding the polypeptide is detected in the biological sample, and 
comparing the presence of the polypeptide or mRNA or genomic DNA encoding the 
polypeptide in the control sample with the presence of the polypeptide or mRNA or 
genomic DNA encoding the polypeptide in the test sample. 

the invention also encompasses kits for detecting the presence of a polypeptide or 
nucleic acid of the invention in a biological sample (a test sample). Such kits can be used to 
determine if a subject is suffering from or is at increased risk of developing a disorder 
associated with aberrant expression of a polypeptide of the invention (e.g., a proliferative 
disorder, e.g., psoriasis or cancer). For example, the kit can comprise a labeled compound 
or agent capable of detecting the polypeptide or mRNA encoding the polypeptide in a 
biological sample and means for determining the amount of the polypeptide or mRNA in 
the sample (e.g., an antibody which binds the polypeptide or an oligonucleotide probe 
which binds to DNA or mRNA encoding the polypeptide). Kits can also include 
instructions for observing that the tested subject is suffering from or is at risk of developing 
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a disorder associated with aberrant expression of the polypeptide if the amount of the 
polypeptide or mRNA encoding the polypeptide is above or below a normal level. 

For antibody-based kits, the kit can comprise, for example: (1) a first antibody (e.g., 
attached to a solid support) which binds to a polypeptide of the invention; and, optionally, 
(2) a second, different antibody which binds to either the polypeptide or the first antibody 

^ and is conjugated to a detectable agent. 

For oligonucleotide-based kits, the kit can comprise, for example: (1) an 
oligonucleotide, e.g., a detectably labeled oligonucleotide, which hybridizes to a nucleic 
acid sequence encoding a polypeptide of the invention or (2) a pair of primers useful for 
amplifying a nucleic acid molecule encoding a polypeptide of the invention, The kit can 

^ also comprise, e.g., a buffering agent, a preservative, or a protein stabilizing agent. The kit 
can also comprise components necessary for detecting the detectable agent (e.g., an enzyme 
or a substrate). The kit can also contain a control sample or a series of control samples 
which can be assayed and compared to the test sample contained. Each component of the 
kit is usually enclosed within an individual container and all of the various containers are 

1 5 within a single package along with instructions for observing whether the tested subject is 
suffering from or is at risk of developing a disorder associated with aberrant expression of 
the polypeptide. 

2. Prognostic Assays 
2 ^ The methods described herein can furthermore be utilized as diagnostic or 

prognostic assays to identify subjects having or at risk of developing a disease or disorder 
associated with aberrant expression or activity of a polypeptide of the invention. For 
example, the assays described herein, such as the preceding diagnostic assays or the 
following assays, can be utilized to identify a subject having or at risk of developing a 

or 

disorder associated with aberrant expression or activity of a polypeptide of the invention. 
Alternatively, the prognostic assays can be utilized to identify a subject having or at risk for 
developing such a disease or disorder. Thus, the present invention provides a method in 
which a test sample is obtained from a subject and a polypeptide or nucleic acid (e.g., 
mRNA, genomic DNA) of the invention is detected, wherein the presence of the 
polypeptide or nucleic acid is diagnostic for a subject having or at risk of developing a 
disease or disorder associated with aberrant expres sion or activity of the polypeptide. As 
used herein, a "test sample" refers to a biological sample obtained from a subject of interest. 
For example, a test sample can be a biological fluid (e.g., serum), cell sample, or tissue. 
Furthermore, the prognostic assays described herein can be used to determine 

if 

whether a subject can be administered an agent (e.g., an agonist, antagonist, 
peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drag candidate) to 
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treat a disease or disorder associated with aberrant expression or activity of a polypeptide of 
the invention. For example, such methods can be used to determine whether a subject can 
be effectively treated with a specific agent or class of agents (e.g., agents of a type which 
decrease activity of the polypeptide). Thus, the present invention provides methods for 
determining whether a subject can be effectively treated with an agent for a disorder 

5 associated with aberrant expression or activity of a polypeptide of the invention in which a 
test sample is obtained and the polypeptide or nucleic acid encoding the polypeptide is 
detected (e.g., wherein the presence of the polypeptide or nucleic acid is diagnostic for a 
subject that can be administered the agent to treat a disorder associated with aberrant 
expression or activity of the polypeptide)* 

* ® The methods of the invention can also be used to detect genetic lesions or mutations 

in a gene of the invention, thereby determining if a subject with the lesioned gene is at risk 
for a disorder characterized aberrant expression or activity of a polypeptide of the invention. 
In preferred embodiments, the methods include detecting, in a sample of cells from the 
subject, the presence or absence of a genetic lesion or mutation characterized by at least one 

15 of an alteration affecting the integrity of a gene encoding the polypeptide of the invention, 
or the mis-expression of the gene encoding the polypeptide of the invention. For example, 
such genetic lesions or mutations can be detected by ascertaining the existence of at least 
one of: 1) a deletion of one or more nucleotides from the gene; 2) an addition of one or 
more nucleotides to the gene; 3) a substitution of one or more nucleotides of the gene; 4) a 
chromosomal rearrangement of the gene; 5) an alteration in the level of a messenger RNA 
transcript of the gene; 6) an aberrant modification of the gene, such as of the methylation 
pattern of the genomic DNA; 7) the presence of a non-wild type splicing pattern of a 
messenger RNA transcript of the gene; 8) a non-wild type level of a the protein encoded by 
the gene; 9) an allelic loss of the gene; and 10) an inappropriate post-translational 

^ modification of the protein encoded by the gene. As described herein, there are a large 
number of assay techniques known in the art which can be used for detecting lesions in a 
gene. 

In certain embodiments, detection of the lesion involves the use of a probe/primer in 
a polymerase chain reaction (PCR) (see, e.g.., U.S. Patent NOs. 4,683,195 and 4,683,202), 

OA 

such as anchor PCR or RACE PCR, or, alternatively, in a ligation chain reaction (LCR) 
(see, e.g., Landegran et al., 1988, Science 241:1077-80; and Nakazawa et al., 1994, Proc. 
Natl Acad. Set USA 91 :360-4), the latter of which can be particularly useful for detecting 
point mutations in a gene (see, e.g., Abravaya et al, 1995, Nucleic Acids Res. 23:675-82). 
This method can include the steps of collecting a sample of cells from a patient, isolating 
35 nucleic acid (e.g., genomic, mRNA or both) from the cells of the sample, contacting the 
nucleic acid sample with one or more primers which specifically hybridize to the selected 
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gene under conditions such that hybridization and amplification of the gene (if present) 
occurs, and detecting the presence or absence of an amplification product, or detecting the 
size of the amplification product and comparing the length to a control sample. It is 
anticipated that PCR and/or LCR may be desirable to use as a preliminary amplification 
step in conjunction with any of the techniques used for detecting mutations described 
5 herein. 

Alternative amplification methods include: self sustained sequence replication 
(Guatelli et aL, 1990, Proc. Natl Acad. Set USA 87:1874-78), transcriptional amplification 
system (Kwoh, et al, 1989, Proc. Natl. Acad. Set USA 86:1 173-7), Q-Beta Replicase 
(Lizardi et al., 1988, Bio/Technology 6:1 197), or any other nucleic acid amplification 
^ method, followed by the detection of the amplified molecules using techniques well known 
to those of skill in the art. These detection schemes are especially useful for the detection 
of nucleic acid molecules if such molecules are present in very low numbers. 

In an alternative embodiment, mutations in a selected gene from a sample cell can 
be identified by alterations in restriction enzyme cleavage patterns. For example, sample 
^ and control DNA is isolated, amplified (optionally), digested with one or more restriction 
endonucleases, and fragment length sizes are determined by gel electrophoresis and 
compared. Differences in fragment length sizes between sample and control DNA indicates 
mutations in the sample DNA. Moreover, the use of sequence specific ribozymes (see, e.g., 
U.S. Patent No. 5,498,531) can be used to score for the presence of specific mutations by 

90 

development or loss of a ribozyme cleavage site. 

In other embodiments, genetic mutations can be identified by hybridizing a sample 
and control nucleic acids, e.g> DNA or RNA, to high density arrays containing hundreds or 
thousands of oligonucleotides probes (Cronin et al., 1996, Human Mutation 7:244-55; 
Kozal et al., 1996, Nature Medicine 2:753-9). For example, genetic mutations can be 
2 ^ identified in two-dimensional arrays containing light-generated DNA probes as described in 
Cronin et ah, supra. Briefly, a first hybridization array of probes can be used to scan 
through long stretches of DNA in a sample and control to identify base changes between the 
sequences by making linear arrays of sequential overlapping probes. This step allows the 
identification of point mutations. This step is followed by a second hybridization array that 

30 

allows the characterization of specific mutations by using smaller, specialized probe arrays 
complementary to all variants or mutations detected. Each mutation array is composed of 
parallel probe sets, one complementary to the wild-type gene and the other complementary 
to the mutant gene. 

In yet another embodiment, any of a variety of sequencing reactions known in the 
^ art can be used to directly sequence the selected gene and detect mutations by comparing 
the sequence of the sample nucleic acids with the corresponding wild-type (control) 
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sequence. Examples of sequencing reactions include those based on techniques developed 
by Maxim and Gilbert (1977, Proc. Natl. Acad. Set USA 74:560) or Sanger (1977, Proc. 
Natl. Acad. Set USA 74:5463). It is also contemplated that any of a variety of automated 
sequencing procedures can be utilized when performing the diagnostic assays developed by 
Naeve et al. (1995, Bio/Techniques 19:448-53), including sequencing by mass spectrometry 

5 (see, e.g., PCX Publication No. WO 94/16101; Cohen et al., 1996, Adv. Chromatogr. 
36:127-62; and Griffin et al, 1993, Appl Biochem. Biotechnol. 38:147-59). 

Other methods for detecting mutations in a selected gene include methods in which 
protection from cleavage agents is used to detect mismatched bases in RNA/RNA or 
RNA/DNA heteroduplexes (Myers et at, 1985, Science 230:1242). In general, the 

^ technique of mismatch cleavage entails providing heteroduplexes formed by hybridizing 
(labeled) RNA or DNA containing the wild-type sequence with potentially mutant RNA or 
DNA obtained from a tissue sample. The double-stranded duplexes are treated with an 
agent which cleaves single-stranded regions of the duplex such as which will exist due to 
basepair mismatches between the control and sample strands. RNA/DNA duplexes can be 

1 5 treated with RNase to digest mismatched regions, and DNA/DNA hybrids can be treated 
with S 1 nuclease to digest mismatched regions. 

In other embodiments, either DNA/DNA or RNA/DNA duplexes can be treated with 
hydroxylamine or osmium tetroxide and with piperidine in order to digest mismatched 
regions. After digestion of the mismatched regions, the resulting material is then separated 

^ by size on denaturing polyacrylamide gels to determine the site of mutation. See, e.g. , 
Cotton et al., 1988, Proc. Natl. Acad. Set USA 85:4397; Saleeba et al., 1992, Methods 
Enzymol. 217:286-95. In a preferred embodiment, the control DNA or RNA can be labeled 
for detection. 

In still another embodiment, the mismatch cleavage reaction employs one or more 
25 proteins that recognize mismatched base pairs in double-stranded DNA (so called DNA 
mismatch repair enzymes) in defined systems for detecting and mapping point mutations in 
cDNAs obtained from samples of cells. For example, the mutY enzyme of E. coli cleaves 
A at G/A mismatches and the thymidine DNA glycosylase from HeLa cells cleaves T at 
G/T mismatches (Hsu et al., 1994, Carcinogenesis 15:1657-62). According to an 
DKJ exemplary embodiment, a probe based on a selected sequence, e.g., a wild-type sequence, is 
hybridized to a cDNA or other DNA product from a test cell(s). The duplex is treated with 
a DNA mismatch repair enzyme, and the cleavage products, if any, can be detected from 
electrophoresis protocols or the like. See, e.g., U.S. Patent No. 5,459,039. 

In other embodiments, alterations in electrophoretic mobility will be used to identify 
35 mutations in genes. For example, single strand conformation polymorphism (SSCP) may 
be used to detect differences in electrophoretic mobility between mutant and wild type 
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nucleic acids (Orita et al,, 1989, Proc. Natl Acad. Set USA 86:2766; see also Cotton, 1993, 
Mutat Res. 285:125-44; Hayashi, 1992, Genet Anal Tech. Appl 9:73-9). Single-stranded 
DNA fragments of sample and control nucleic acids will be denatured and allowed to 
renature. The secondary structure of single-stranded nucleic acids varies according to 
sequence, and the resulting alteration in electrophoretic mobility enables the detection of 
even a single base change. The DNA fragments may be labeled or detected with labeled 
probes. The sensitivity of the assay may be enhanced by using RNA (rather than DNA), in 
which the secondary structure is more sensitive to a change in sequence. In a preferred 
embodiment, the subject method utilizes heteroduplex analysis to separate double stranded 
heteroduplex molecules on the basis of changes in electrophoretic mobility (Keen et al., 
1991, Trends Genet. 7:5). 

In yet another embodiment, the movement of mutant or wild-type fragments in 
polyacrylamide gels containing a gradient of denaturant is assayed using denaturing 
gradient gel electrophoresis (DGGE) (Myers et al., 1985, Nature 313:495). When DGGE is 
used as the method of analysis, DNA will be modified to insure that it does not completely 
denature, for example by adding a 'GC clamp of approximately 40 bp of high-melting GC- 
rich DNA by PCR. In a further embodiment, a temperature gradient is used in place of a 
denaturing gradient to identify differences in the mobility of control and sample DNA 
(Rosenbaum and Reissner, 1987, Biophys* Chem. 265:12753). 

Examples of other techniques for detecting point mutations include, but are not 
limited to, selective oligonucleotide hybridization, selective amplification, or selective 
primer extension. For example, oligonucleotide primers may be prepared in which the 
known mutation is placed centrally and then hybridized to target DNA under conditions 
which permit hybridization only if a perfect match is found (Saiki et al., 1986, Nature 
324:163; Saild et al., 1989, Proc. Natl Acad. Set USA 86:6230). Such allele specific 
oligonucleotides are hybridized to PCR amplified target DNA or a number of different 
mutations when the oligonucleotides are attached to the hybridizing membrane and 
hybridized with labeled target DNA. 

Alternatively, allele specific amplification technology which depends on selective 
PCR amplification may be used in conjunction with the instant invention. Oligonucleotides 
used as primers for specific amplification may carry the mutation of interest in the center of 
the molecule (so that amplification depends on differential hybridization; Gibbs et al., 1989, 
Nucleic Acids Res. 17:2437-48) or at the extreme 3 1 end of one primer where, under 
appropriate conditions, mismatch can prevent or reduce polymerase extension (Prossner, 
1993, Tibtech 1 1 :238). In addition, it may be desirable to introduce a novel restriction site 
in the region of the mutation to create cleavage-based detection (Gasparini et al., 1992, Mol 
Cell Probes 6:1). It is anticipated that in certain embodiments amplification may also be 
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performed using Taq ligase for amplification (Barany, 1991, Proc. Natl Acad, Scl USA 
88: 1 89). In such cases, ligation will occur only if there is a perfect match at the 3' end of 
the 5* sequence making it possible to detect the presence of a known mutation at a specific 
site by looking for the presence or absence of amplification. 

The methods described herein may be performed, for example, by utilizing pre- 
packaged diagnostic kits comprising at least one probe nucleic acid or antibody reagent 
described herein, which may be conveniently used, e.g., in clinical settings to diagnose 
patients exhibiting symptoms or family history of a disease or illness involving a gene 
encoding a polypeptide of the invention. Furthermore, any cell type or tissue, preferably 
peripheral blood leukocytes, in which the polypeptide of the invention is expressed may be 

* ® utilized in the prognostic assays described herein. 

3. Pharmacogenomics 

Agents, or modulators which have a stimulatory or inhibitory effect on activity or 
expression of a polypeptide of the invention as identified by a screening assay described 

* ^ herein can be administered to individuals to treat (prophylactically or therapeutically) 

disorders associated with aberrant activity of the polypeptide. In conjunction with such 
treatment, the pharmacogenomics (i.e., the study of the relationship between an individual's 
genotype and that individual's response to a foreign compound or drug) of the individual 
may be considered. Differences in metabolism of therapeutics can lead to severe toxicity or 

10 

therapeutic failure by altering the relation between dose and blood concentration of the 
pharmacologically active drug. Thus, the pharmacogenomics of the individual permits the 
selection of effective agents (e.g., drugs) for prophylactic or therapeutic treatments based on 
a consideration of the individual's genotype. Such pharmacogenomics can further be used 
to determine appropriate dosages and therapeutic regimens. Accordingly, the activity of a 

^ polypeptide of the invention, expression of a nucleic acid of the invention, or mutation 
content of a gene of the invention in an individual can be determined to thereby select 
appropriate agent(s) for therapeutic or prophylactic treatment of the individual. 

Pharmacogenomics deals with clinically significant hereditary variations in the 
response to drugs due to altered drug disposition and abnormal action in affected persons. 

30 See, e.g., Linder, 1997, Clin. Chem. 43(2):254-66. In general, two types of 

pharmacogenetic conditions can be differentiated. Genetic conditions transmitted as a 
single factor altering the way drugs act on the body are referred to as "altered drug action." 
Genetic conditions transmitted as single factors altering the way the body acts on drugs are 
referred to as "altered drug metabolism". These pharmacogenetic conditions can occur 

35 either as rare defects or as polymorphisms. For example, glucose-6-phosphate 

dehydrogenase deficiency (G6PD) is a common inherited enzymopathy in which the main 
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clinical complication is haemolysis after ingestion of oxidant drugs (anti-malarials, 
sulfonamides, analgesics, nitrofurans) and consumption of fava beans. 

As an illustrative embodiment, the activity of drug metabolizing enzymes is a major 
determinant of both the intensity and duration of drug action. The discovery of genetic 
polymorphisms of drug metabolizing enzymes (e.g. 9 N-acetyltransferase 2 (NAT 2) and 
cytochrome P450 enzymes CYP2D6 and CYP2C19) has provided an explanation as to why 
some patients do not obtain the expected drug effects or show exaggerated drug response 
and serious toxicity after taking the standard and safe dose of a drug. These polymorphisms 
are expressed in two phenotypes in the population, the extensive metabolizer (EM) and poor 
metabolizer (PM). The prevalence of PM is different among different populations. For 

^ example, the gene coding for CYP2D6 is highly polymorphic and several mutations have 
been identified in PM, which all lead to the absence of functional CYP2D6. Poor 
metabolizers of CYP2D6 and CYP2C19 quite frequently experience exaggerated drug 
response and side effects when they receive standard doses. If a metabolite is the active 
therapeutic moiety, a PM will show no therapeutic response, as demonstrated for the 

^ analgesic effect of codeine mediated by its CYP2D6-formed metabolite morphine. The 
other extreme are the so called ultra-rapid metabolizers who do not respond to standard 
doses. Recently, the molecular basis of ultra-rapid metabolism has been identified to be due 
to C YP2D6 gene amplification. 

Thus, the activity of a polypeptide of the invention, expression of a nucleic acid 

20 

encoding the polypeptide, or mutation content of a gene encoding the polypeptide in an 
individual can be determined to thereby select appropriate agent(s) for therapeutic or 
prophylactic treatment of the individual. In addition, pharmacogenetic studies can be used 
to apply genotyping of polymoiphic alleles encoding drug-metabolizing enzymes to the 

identification of an individual's drug responsiveness phenotype. This knowledge, when 

2S ♦ * 

applied to dosing or drug selection, can avoid adverse reactions or therapeutic failure and 

thus enhance therapeutic or prophylactic efficiency when treating a subject with a 

modulator of activity or expression of the polypeptide, such as a modulator identified by 

one of the exemplary screening assays described herein. 

OA 

4. Monitoring of Effects During Clinical Trials 

Monitoring the influence of agents {e.g., drugs, compounds) on the expression or 
activity of a polypeptide of the invention (e.g. P the ability to modulate aberrant cell 
proliferation chemotaxis, and/or differentiation) can be applied not only in basic drug 
screening, but also in clinical trials. For example, the effectiveness of an agent, as 
determined by a screening assay as described herein, to increase gene expression, protein 
levels or protein activity, can be monitored in clinical trials of subjects exhibiting decreased 
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gene expression, protein levels, or protein activity. Alternatively, the effectiveness of an 
agent, as determined by a screening assay, to decrease gene expression, protein levels or 
protein activity, can be monitored in clinical trials of subjects exhibiting increased gene 
expression, protein levels, or protein activity. In such clinical trials, expression or activity 
of a polypeptide of the invention and preferably, that of other polypeptide that have been 

^ implicated in for example, a cellular proliferation disorder, can be used as a marker of the 
immune responsiveness of a particular cell. 

For example, and not by way of limitation, genes, including those of the invention, 
that are modulated in cells by treatment with an agent (e.g., compound, drug or small 
molecule) which modulates activity or expression of a polypeptide of the invention (e.g., as 

^ identified in a screening assay described herein) can be identified. Thus, to study the effect 
of agents on cellular proliferation disorders, for example, in a clinical trial, cells can be 
isolated and RNA prepared and analyzed for the levels of expression of a gene of the 
invention and other genes implicated in the disorder. The levels of gene expression (i.e., a 
gene expression pattern) can be quantified by Northern blot analysis or RT-PCR, as 

15 described herein, or alternatively by measuring the amount of protein produced, by one of 
the methods as described herein, or by measuring the levels of activity of a gene of the 
invention or other genes. In this way, the gene expression pattern can serve as a marker, 
indicative of the physiological response of the cells to the agent. Accordingly, this response 
state may be determined before, and at various points during, treatment of the individual 

20 with the agent. 

In a preferred embodiment, the present invention provides a method for monitoring 
the effectiveness of treatment of a subject with an agent (e.g., an agonist, antagonist, 
peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug candidate 
identified by the screening assays described herein) comprising the steps of (i) obtaining a 

25 pre-adrninistration sample from a subject prior to administration of the agent; (ii) detecting 
the level of the polypeptide or nucleic acid of the invention in the preadministration sample; 
(iii) obtaining one or more post-administration samples from the subject; (iv) detecting the 
level the of the polypeptide or nucleic acid of the invention in the post-administration 
samples; (v) comparing the level of the polypeptide or nucleic acid of the invention in the 

^ pre-administration sample with the level of the polypeptide or nucleic acid of the invention 
in the post-administration sample or samples; and (vi) altering the administration of the 
agent to the subject accordingly. For example, increased administration of the agent may be 
desirable to increase the expression or activity of the polypeptide to higher levels than 
detected, i.e., to increase the effectiveness of the agent. Alternatively, decreased 

35 administration of the agent may be desirable to decrease expression or activity of the 
polypeptide to lower levels than detected, i.e., to decrease the effectiveness of the agent. 
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C. Methods of Treatment 

The present invention provides for both prophylactic and therapeutic methods of 
treating a subject at risk of (or susceptible to) a disorder or having a disorder associated 
with aberrant expression or activity of a polypeptide of the invention, e.g., cardiac infection 
(e.g., myocarditis or dilated cardiomyopathy), central nervous system infection (e.g., non- 
specific febrile illness or meningoencephalitis), pancreatic infection (e.g., acute 
pancreatitis), respiratory infection (pneumonia), gastrointestinal infection, type I diabetes, 
cancer, familia hypercholesterolemia, treat hemophilia B, Marfan syndrome, protein S 
deficiency, allergy, inflammation, and gastroduodenal ulcer. Moreover, the polypeptides of 
the invention can be used to modulate cellular function, survival, morphology, proliferation 
and/or differentiation, 

1. Prophylactic Methods 

In one aspect, the invention provides a method for preventing in a subject, a disease 
or condition associated with an aberrant expression or activity of a polypeptide of the 
invention, by administering to the subject an agent which modulates expression or at least 
one activity of the polypeptide. Subjects at risk for a disease which is caused or contributed 
to by aberrant expression or activity of a polypeptide of the invention can be identified by, 
for example, any or a combination of diagnostic or prognostic assays as described herein. 
Administration of a prophylactic agent can occur prior to the manifestation of symptoms 
characteristic of the aberrancy, such that a disease or disorder is prevented or, alternatively, 
delayed in its progression. Depending on the type of aberrancy, for example, an agonist or 
antagonist agent can be used for treating the subject. 



2* Therapeutic Methods 

Another aspect of the invention pertains to methods of modulating expression or 
activity of a polypeptide of the invention for therapeutic purposes. The modulatory method 
of the invention involves contacting a cell with an agent that modulates one or more of the 
activities of the polypeptide. An agent that modulates activity can be an agent as described 
herein, such as a nucleic acid or a protein, a naturally-occurring cognate ligand of the 
polypeptide, a peptide, a peptidomimetic, or other small molecule. In one embodiment, the 
agent stimulates one or more of the biological activities of the polypeptide. Examples of 
such stimulatory agents include the active polypeptide of the invention and a nucleic acid 
molecule encoding the polypeptide of the invention that has been introduced into the cell. 
In another embodiment, the agent inhibits one or more of the biological activities of the 
polypeptide of the invention. Examples of such inhibitory agents include antisense nucleic 
acid molecules and antibodies. These modulatory methods can be performed in vitro (e.g>, 
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by culturing the cell with the agent) or, alternatively, in vivo (e.g., by administering the 
agent to a subject). As such, the present invention provides methods of treating an 
individual afflicted with a disease or disorder characterized by aberrant expression or 
activity of a polypeptide of the invention. In one embodiment, the method involves 
administering an agent (e.g., an agent identified by a screening assay described herein), or 

5 combination of agents that modulates (e.g., upregulates or downregulates) expression or 
activity. In another embodiment, the method involves administering a polypeptide of the 
invention or a nucleic acid molecule of the invention as therapy to compensate for reduced 
or aberrant expression or activity of the polypeptide. 

Stimulation of activity is desirable in situations in which activity or expression is 

* 0 abnormally low or downregulated and/or in which increased activity is likely to have a 
beneficial effect. Conversely, inhibition of activity is desirable in situations in which 
activity or expression is abnormally high or upregulated and/or in which decreased activity 
is likely to have a beneficial effect. 

The contents of all references, patents and published patent applications cited 

1 5 throughout this application are hereby incorporated by reference. 

Deposit of Clones 

Clones containing cDNA molecules encoding human MANGO 003 were deposited 
with the American Type Culture Collection (ATCC® 10801 University Boulevard, 
20 Manassas, VA 20110-2209) on March 30, 1999 as Accession Number 207178, as part of a 
composite deposit representing a mixture of three strains, each carrying one recombinant 
plasmid harboring a particular cDNA clone. 

To distinguish the strains and isolate a strain harboring a particular cDNA clone, an 
aliquot of the mixture can be streaked out to single colonies on nutrient medium (e.g. , LB 
plates) supplemented with 1 00 g/ml ampicillin, single colonies grown, and then plasmid 
DNA extracted using a standard minipreparation procedure. Next, a sample of the DNA 
minipreparation can be digested with a combination of the restriction enzymes Sal I and Not 
I, and the resultant products resolved on a 0.8% agarose gel using standard DNA 
electrophoresis conditions. The digest liberates fragments as follows: 

30 

human MANGO 003 (clone EpthLa6al): 3,2 kB 

The identity of Hie strains can be inferred from the fragments liberated. 

35 Clones containing cDNA molecules encoding human INTERCEPT 340, MANGO 

347, and TANGO 272 were deposited with the American Type Culture Collection (ATCC® 
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10801 University Boulevard, Manassas, VA 20110-2209) on June 18, 1999 as Accession 
Number PTA-250, as part of a composite deposit representing a mixture of three strains, 
each carrying one recombinant plasmid harboring a particular cDNA clone. 

To distinguish the strains and isolate a strain harboring a particular cDNA clone, an 
aliquot of the mixture can be streaked out to single colonies on nutrient medium (e.g., LB 
plates) supplemented with 100 g/ml ampicillin, single colonies grown, and then plasmid 
DNA extracted using a standard minipreparation procedure. Next, a sample of the DNA 
minipreparation can be digested with a combination of the restriction enzymes Sal I and Not 
I, and the resultant products resolved on a 0.8% agarose gel using standard DNA 
electrophoresis conditions. The digest liberates fragments as follows: 

human INTERCEPT 340 (clone EpI340): 3.3 kB 
human MANGO 347 (clone EpM347): 1.4 kB 
human TANGO 272 (clone EpT272): 5.0 kB 

^ The identity of the strains can be inferred from the fragments liberated. 

Clones containing cDNA molecules encoding human TANGO 295, TANGO 354, 
and TANGO 378 were deposited with the American Type Culture Collection (ATCC® 
10801 University Boulevard, Manassas, VA 20110-2209) on June 18, 1999 as Accession 
Number PTA-249, as part of a composite deposit representing a mixture of three strains, 
each carrying one recombinant plasmid harboring a particular cDNA clone. 

To distinguish the strains and isolate a strain harboring a particular cDNA clone, an 
aliquot of the mixture can be streaked out to single colonies on nutrient medium (e.g., LB 
plates) supplemented with 100 g/ml ampicillin, single colonies grown, and then plasmid 
DNA extracted using a standard minipreparation procedure. Next, a sample of the DNA 
minipreparation can be digested with a combination of the restriction enzymes Sal I and Not 
I, and the resultant products resolved on a 0.8% agarose gel using standard DNA 
electrophoresis conditions. The digest liberates fragments as follows: 



20 



25 



30 



35 



human TANGO 295 (clone EpT295): 1.5 kB 
human TANGO 354 (clone EpT354): 1.8 kB 
human TANGO 378 (clone EpT378): 3.3 kB 

The identity of the strains can be inferred from the fragments liberated. 
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All publications, patents and patent applications mentioned in this specification are 
herein incorporated by reference into the specification to the same extent as if each 
individual publication, patent or patent application was specifically and individually 
indicated to be incorporated herein by reference. 

Equivalents 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, many equivalents to the specific embodiments of the invention 
described herein. Such equivalents are intended to be encompassed by the following 
Claims. 
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What is claimed is : 

L An isolated nucleic acid molecule selected from the group consisting of: 

a) a nucleic acid molecule comprising a nucleotide sequence which is at least 
55% identical to the nucleotide sequence of SEQ ID NOs:l, 3, 4, 6, 7, 9, 10, 12, 13, 15, 16, 

5 18, 19, 21, 22, 24, 25, 27, 28, 30, the cDNA insert of the plasmid deposited with the 

ATCC® as Accession Number 207178, the cDNA insert of the plasmid deposited with the 
ATCC® as Accession Number PTA-249, the cDNA insert of the plasmid deposited with the 
ATCC® as Accession Number PTA-250, or a complement thereof; 

b) a nucleic acid molecule comprising a fragment of at least 300 nucleotides of 
10 the nucleotide sequence of SEQ ID NOs:l, 3, 4, 6, 7, 9, 10, 12, 13, 15, 16, 18, 19, 21, 22, 

24, 25, 27, 28, 30, the cDNA insert of the plasmid deposited with the ATCC® as Accession 
Number 207178, the cDNA insert of the plasmid deposited with the ATCC® as Accession 
Number PTA-249, the cDNA insert of the plasmid deposited with the ATCC® as Accession 
Number PTA-250, or a complement thereof; 

^ c) a nucleic acid molecule which encodes a polypeptide comprising the amino 

acid sequence of SEQ ID NOs:2, 5, 8, 11, 14, 17, 20, 23, 26, 29, the amino acid sequence 
encoded by the cDNA insert of the plasmid deposited with the ATCC® as Accession 
Number 207178, the amino acid sequence encoded by the cDNA insert of the plasmid 
deposited with the ATCC® as Accession Number PTA-249, or the amino acid sequence 

^ encoded by the cDNA insert of the plasmid deposited with the ATCC® as Accession 
Number PTA-250; 

d) a nucleic acid molecule which encodes a fragment of a polypeptide 
comprising the amino acid sequence of SEQ ID NOs:2, 5, 8, 11, 14, 17, 20, 23, 26, 29, the 
amino acid sequence encoded by the cDNA insert of the plasmid deposited with the ATCC® 

25 as Accession Number 207178, the amino acid sequence encoded by the cDNA insert of the 
plasmid deposited with the ATCC® as Accession Number PTA-249, or the amino acid 
sequence encoded by the cDNA insert of the plasmid deposited with the ATCC® as 
Accession Number PTA-250, wherein the fragment comprises at least 15 contiguous amino 
acids of SEQ ID NOs:2, 5, 8, 1 1, 14, 17, 20, 23, 26, 29, the amino acid sequence encoded 

30 by the cDNA insert of the plasmid deposited with the ATCC® as Accession Number 
207178, the amino acid sequence encoded by the cDNA insert of the plasmid deposited 
with the ATCC® as Accession Number PTA-249, or the amino acid sequence encoded by 
the cDNA insert of the plasmid deposited with the ATCC® as Accession Number PTA-250; 
and 

D 5 e) a nucleic acid molecule which encodes a naturally occurring allelic variant of 

a polypeptide comprising the amino acid sequence of SEQ ID NOs:2, 5, 8, 11, 14, 17, 20, 
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23, 26, 29, the amino acid sequence encoded by the cDNA insert of the plasmid deposited 
with the ATCC® as Accession Number 207178, the amino acid sequence encoded by the 
cDNA insert of the plasmid deposited with the ATCC® as Accession Number PTA-249, or 
the amino acid sequence encoded by the cDNA insert of the plasmid deposited with the 
ATCC® as Accession Number PTA-250, wherein the nucleic acid molecule hybridizes to a 
5 nucleic acid molecule comprising SEQ ID NOs;l, 3,4, 6, 7, 9, 10, 12, 13, 15, 16, 18, 19, 
21, 22, 24, 25, 27, 28, 30, or a complement thereof, under stringent conditions, 

2. The isolated nucleic acid molecule of Claim 1, which is selected from the 
group consisting of: 

^ a) a nucleic acid comprising the nucleotide sequence of SEQ ID NOs:l, 3, 4, 6, 

7, 9, 10, 12, 13, 15, 16, 18, 19, 21, 22, 24, 25, 27, 28, 30, the cDNA insert of the plasmid 
deposited with the ATCC® as Accession Number 207178, the cDNA insert of the plasmid 
deposited with the ATCC® as Accession Number PTA-249, the cDNA insert of the plasmid 
deposited with the ATCC® as Accession Number PTA-250, or a complement thereof; and 

15 b) a nucleic acid molecule which encodes a polypeptide comprising the amino 

acid sequence of SEQ ID NOs:2, 5, 8, 1 1, 14, 17, 20, 23, 26, 29, the amino acid sequence 
encoded by the cDNA insert of the plasmid deposited with the ATCC® as Accession 
Number 207178, the amino acid sequence encoded by the cDNA insert of the plasmid 
deposited with the ATCC® as Accession Number PTA-249, or the amino acid sequence 

^ encoded by the cDNA insert of the plasmid deposited with the ATCC® as Accession 
Number PTA-250, 



25 



30 



35 



3. The nucleic acid molecule of Claim 1 further comprising vector nucleic acid 
sequences. 

4. The nucleic acid molecule of Claim 1 further comprising nucleic acid 
sequences encoding a heterologous polypeptide. 

5. A host cell which contains the nucleic acid molecule of Claim 1 . 

6. The host cell of Claim 5 which is a mammalian host cell. 

7. A non-human mammalian host cell containing the nucleic acid molecule of 
Claim 1. 

8. An isolated polypeptide selected from the group consisting of: 
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a) a fragment of a polypeptide comprising the amino acid sequence of SEQ ID 
NOs:2, 5, 8, 1 1 5 14, 17, 20, 23, 26, or 29, wherein the fragment comprises at least 15 
contiguous amino acids of SEQ ID NOs:2, 5, 8, U, 14, 17, 20, 23, 26, or 29; 

b) a naturally occurring allelic variant of a polypeptide comprising the amino 
acid sequence of SEQ ID NOs:2, 5, 8, 1 1, 14, 17, 20, 23, 26, or 29* the amino acid sequence 

5 encoded by the cDNA insert of the plasmid deposited with the ATCC® as Accession 
Number 207178, the amino acid sequence encoded by the cDNA insert of the plasmid 
deposited with the ATCC® as Accession Number PTA-249, or the amino acid sequence 
encoded by the cDNA insert of the plasmid deposited with the ATCC® as Accession 
Number PTA-250, wherein the polypeptide is encoded by a nucleic acid molecule which 

10 hybridizes to a nucleic acid molecule comprising SEQ ID NOs: 1, 3, 4, 6, 7, 9, 10, 12, 13, 
15, 16, 18, or a complement thereof under stringent conditions; and 

c) a polypeptide which is encoded by a nucleic acid molecule comprising a 
nucleotide sequence which is at least 55% identical to a nucleic acid comprising the 
nucleotide sequence of SEQ ID NOs:l, 3, 4, 6, 7, 9, 10, 12, 13, 15, 16, 18, 19, 21, 22, 24, 

15 25, 27, 28, 30, or a complement thereof. 

9. The isolated polypeptide of Claim 8 comprising the amino acid sequence of 
SEQ ID NOs:2, 5, 8, 1 1, 14, 17, 20, 23, 26, or 29. 



20 



10. The polypeptide of Claim 8 further comprising heterologous amino acid 
sequences. 

11. An antibody which selectively binds to a polypeptide of Claim 8. 

12. A method for producing a polypeptide selected from the group consisting of: 

a) a polypeptide comprising the amino acid sequence of SEQ ID NOs:2, 5, 8, 
11, 14, 17, 20, 23, 26, or 29, the amino acid sequence encoded by the cDNA insert of the 
plasmid deposited with the ATCC® as Accession Number 207178, the amino acid sequence 
encoded by the cDNA insert of the plasmid deposited with the ATCC® as Accession 
Number PTA-249, or the amino acid sequence encoded by the cDNA insert of the plasmid 

30 deposited with the ATCC® as Accession Number PTA-250; 

b) a polypeptide comprising a fragment of the amino acid sequence of SEQ ID 
NOs:2, 5, 8, 1 1, 14, 17, 20, 23, 26, or 29, the amino acid sequence encoded by the cDNA 
insert of the plasmid deposited with the ATCC® as Accession Number 207178, the amino 
acid sequence encoded by the cDNA insert of the plasmid deposited with the ATCC® as 

35 Accession Number PTA-249, or the amino acid sequence encoded by the cDNA insert of 
the plasmid deposited with the ATCC® as Accession Number PTA-250, wherein the 
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fragment comprises at least 15 contiguous amino acids of SEQ ID NOs:2, 5, 8, 11, 14, 17, 
20, 23, 26, or 29, the amino acid sequence encoded by the cDNA insert of the plasmid 
deposited with the ATCC® as Accession Number 207178, the amino acid sequence encoded 
by the cDNA insert of the plasmid deposited with the ATCC® as Accession Number PTA- 
249, or the amino acid sequence encoded by the cDNA insert of the plasmid deposited with 

5 the ATCC® as Accession Number PTA-250; and 

c) a naturally occurring allelic variant of a polypeptide comprising the amino 
acid sequence of SEQ ID NOs:2, 5, 8, 11, 14, 17, 20, 23, 26, or 29, the amino acid sequence 
encoded by the cDNA insert of the plasmid deposited with the ATCC® as Accession 
Number 207178, the amino acid sequence encoded by the cDNA insert of the plasmid 

1 0 deposited with the ATCC® as Accession Number PTA-249, or the amino acid sequence 
encoded by the cDNA insert of the plasmid deposited with the ATCC® as Accession 
Number PTA-250, wherein the polypeptide is encoded by a nucleic acid molecule which 
hybridizes to anucleic acid molecule comprising SEQ ED NOs:l, 3, 4, 6, 7, 9, 10, 12, 13, 
15, 16, 18, 19, 21, 22, 24, 25, 27, 28, 30, or a complement thereof under stringent 

^ conditions; 

comprising culturing the host cell of Claim 5 under conditions in which the nucleic 
acid molecule is expressed. 



20 
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13. A method for detecting the presence of a polypeptide of Claim 8 in a sample, 
comprising: 

a) contacting the sample with a compound which selectively binds to a 
polypeptide of Claim 8; and 

b) determining whether the compound binds to the polypeptide in the sample. 

14. The method of Claim 13, wherein the compound which binds to the 
polypeptide is an antibody. 

15. A kit comprising a compound which selectively binds to a polypeptide of 
Claim 8 and instructions for use. 



30 



16. A method for detecting the presence of a nucleic acid molecule of Claim 1 in 
a sample, comprising the steps of: 

a) contacting the sample with a nucleic acid probe or primer which selectively 
hybridizes to the nucleic acid molecule; and 

35 

b) determining whether the nucleic acid probe or primer binds to a nucleic acid 
molecule in the sample. 
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1 7. The method of Claim 1 6, wherein the sample comprises mKNA molecules 
and is contacted with a nucleic acid probe. 

18. A kit comprising a compound which selectively hybridizes to a nucleic acid 
molecule of Claim 1 and instructions for use. 

5 

19. A method for identifying a compound which binds to a polypeptide of Claim 
8 comprising the steps of; 

a) contacting a polypeptide, or a cell expressing a polypeptide of Claim 8 with 
a test compound; and 
10 b) determining whether the polypeptide binds to the test compound. 

20. The method of Claim 19, wherein the binding of the test compound to the 
polypeptide is detected by a method selected from the group consisting of: 

a) detection of binding by direct detecting of test compound/polypeptide 
15 binding; 

b) detection of binding using a competition binding assay; 

c) detection of binding using an assay for INTERCEPT 340-, MANGO 003-, 
MANGO 347-, TANGO 272-, TANGO 295-, TANGO 354-, or TANGO 378-mediated 
signal transduction. 

20 

21. A method for modulating the activity of a polypeptide of Claim 8 
comprising contacting a polypeptide or a cell expressing a polypeptide of Claim 8 with a 
compound which binds to the polypeptide in a sufficient concentration to modulate the 
activity of the polypeptide. 

25 

22. A method for identifying a compound which modulates the activity of a 
polypeptide of Claim 8, comprising: 

a) contacting a polypeptide of Claim 8 with a test compound; and 

b) determining the effect of the test compound on the activity of the 
polypeptide to thereby identify a compound which modulates the activity of the 
polypeptide. 



35 
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Input file I340Athso102bT2; Output File l340Athso102b12.pat 
Sequence length 3284 

GTCGACCCACGCGTCCGTTATGTAACTATACATTTTCCCAGAAATTTTAGTATATGATATGATTTTGTTTTCTTTCATC 79 

CCTTTTCCCAAGCAGTTTATTATGAAAATTTTCAAACATACAGCAATGTTGAGAAAATTTTACAGTAAATGCCTATACC 158 

CATTACCTAAATTTTACCATTAACATTTTACCCTGCTGGCATTATTGTGCTTATCCATCTACGTATCCCTCTCTCCCTT 237 

CATTGGTGTATTTCTAAGTAAATTGTAGGCCTCAGTACACTTCCTTCTGAATTCTTCAGCATGCACAACAGTATTATAT 316 

TCCATTTTTAAAAGAGCAATTCTTGATAGATTTATATAGTTTTGTAAAATGTTCATATAGAGCTACAAATTTTATCTTT 395 

TTGTTTCTTATTGTATGTCTAGGGTCCTGAAGGGGATGCTGGCATTGTTGGGATATCAGGTCCTAAAGGTCCTATTGGA 474 

CACAGAGGAAACACTGGTCCCCTTGGCAGAGAAGGTATMTAGGCCCAACAGGTAGAACTGGACCCAGAGGTGAAAAGG 553 

GCTTTAGAGGTGAAACTGGTCCTCAAGGACCAAGAGGTCAACCAGGGCCTCCAGGTCCACCTGGAGCACCAGGCCCAAG 632 

AAAGCAAATGGATATCAATGCTGCTATTCAAGCCTTGATTGAATCAAATACTGCCCTACAGATGGAGGTAACATATCTG 71 1 

GTTTTATTTATATTGGCACTGTCTCTCAATATACCAATTAAACAGAGAAAATTTTTGGAGGCCAAAATGTGACATTATC 790 

TCAAAGATTGTATTTAAAACAGATTGAAAATGTGAAACCATTCTCAAGAACAAAGTAAGTGATTTTGGTATAATTAAAC 869 

AGAAATATATGCGTAGGATGTTTTGTAAGGAAAACATTTAAATCAAAAATTTAGTACTGTTATTTGTAAGGAATTTGGT 948 

ACTATCCAAGAAAGTAGTTAAATGAGGTTAGCCATGTTTCTTAAAATGAGATATATATATTATCACTACTCATTTATTT 1027 

AAACTCTAATGATTCAATGTGTAATTTAAAAAACATAATACAGTAGACATAGCAATTCTTATGTTAGCTTGAAAACTAA 1106 

ACTTGCAAATGTGAATTTAACCTC T TTAAAAGAT TAAGGTTAT TAAAGCATACACATATGCCTATGCTTAAATATAAAC 1 1 85 

METHSSPALA 10 

TGTTCTTTACATTCTACTCACAACTTACTACACATA ATG GAA ACA CAT TCT TCT CCT GCC TTG GCC 1251 

HV GPQDFFV Y I I LMM T WQSY 30 

CAT GTT GGT CCT CAG GAT TTT TTT GTT TAT ATA ATT CTT ATG ATG ACT TGG CAG AGC TAC 1311 

QNTEVTL IDHSEE IFKTLNY 50 

CAG AAT ACT GAA GTG ACT TTA ATT GAC CAC AGT GAA GAG ATA TTC AAA ACC CTG AAC TAC 1371 



FIG.1A 
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LSNLLH S I KNPL GTRDNPAR 70 
CTT AGC AAT TTA TTG CAC AGC ATC AAG AAT CCT CTT GGC ACA CGA GAT AAC CCA GCA CGA 1431 

ICKDLLNCEQK V SDGKYW! D 90 

ATC TGC AAA GAT TTA CTT AAC TGT GAA CAA AAA GTA TCA GAT GGA AAA TAC TGG ATT GAC 1491 

PNLGCPSDA I EVFCNF SAGG 110 

CCA AAT CTT GGC TGT CCT TCA GAT GCC ATT GAG GTT TTC TGC AAT TTC AGT GCT GGT GGC 1551 

QTC L P PVS V.TKL EFGVGKVQ 130 

CAG ACA TGC TTA CCT CCT GTT TCT GTA ACA AAG TTG GAG TTT GGA GTT GGG AAA GTC CAG 1611 

MNFLHLLSSEATH I IT IHC L150 

ATG AAC TTC CTT CAT TTA CTG AGT TCG GAA GCC ACC CAT ATC ATC ACC ATT CAC TGT CTA 1671 

NTPRWTST QTSGP GLP IGF K 170 

AAC ACC CCA AGG TGG ACA AGC ACA CAA ACA AGT GGC CCA GGA TTG CCT ATT GGT TTC AAG 1731 

GW NGQ I FKVNTLLEPKVLS D 190 

GGA TGG AAT GGC CAG ATT TTT AAA GTA AAC ACT CTA CTT GAA CCT AAA GTG CTT TCA GAT 1791 

0 C K I Q D G S W H K A T F L F H T Q E 210 

GAC TGC AAG ATT CAA GAT GGC AGC TGG CAT AAG GCA ACA TTT CTT TTT CAC ACC CAG GAA 1851 

PNQLPV IEVQ KLP H LKT ERK 230 

CCT AAT CAA CTT CCA GTG ATT GAA GTA CAA AAA CTT CCT CAT CTC AAA ACT GAA CGA AAG 1911 

Y Y I DSSSVCFL* 242 

TAT TAC ATT GAC AGC AGT TCT GTA TGC TTT CTG TAA 1947 

AGTCTCTGAATTAGTTCCGAATTCAGGCTGTTGGCCAGGTAATTGCTGCAGAGGGAGAAATAAGACAGACAGATACAGT 2026 

CATTATGAAATGCATGTAATAAAGCATTGGCTAAATCTTAAAGAATCTCAGGAAGAACAGACTTCCTCCTAAGAAGGAG 2 1 05 

AAAAGGCATTTTTAAAGGACTATGATTGATAAAGTATTTAATTCTTTTAAAAATTATATTCATCTCAGCTTTCTTAGAG 2184 

AATTCCCTAGAACTAAAAATTTATAAATATGGAATTCTTCAGGGTATCTTATATTTTTGACTGAGTGCGTAGTACCCAT 2263 

TAGACAGCTGGAGATGCAGAGCACTATGGAGCAATACTGGCTAATGCTTCCAGATGTGCACTGCTTCTGTCTAAAAATT 2342 

ACAAGCCACAGTCTAATATGTCTTATTTTCCAAAACACTAAGCTGTATTCAGGTCCCCGATGGGCATATACATCTTAGC 2421 

FIG.1B 
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CGGTGATACACTACCTCTTACGTGTTGCCTCTTTGTGTTGCTTGGTGCTCTTTCGAAAACAAGGTGCTTATGGCTTTCA 2500 

TAGACTATTTCCTTTTTCATCTTTGTCAT7CTTTAAAAGTGTATGTACTGGTTACATCAAGATATGTTTTGGTTGTTAG 2579 

TACTTATTTTAATTTGTTTGGTCACACACTTAATAACACATGAAACTATTTATGTGAAGTCCTTGTTTTATTTTAAAAT 2658 

TCrCTTTGTGTATTTGGAATCAAAGCCAGCACATTGTAACCTGTGCTTGTACGCAAAAGAATTAGATTTCTTTGTTTTT 2737 

GTTTTATTTTTTAAATTGTTGTAAAAATTATTATAGGCCAGCTACATCTAGTAGTAGGTTTGGGGTACAGATTGGGGGT 2816 

TG TGCCATACTG T T T TT A AAG T TCATG ATCATCTGGAATG ATACT TAG TG T ATATAT ATTTTG TAAAG T TT T AAT TCAG 2895 

CAAATTTTTTGAAATTGCTGCTGTTTTAAATTATAAAACCTTTATATTTCTGCTT7GTAGAAATTATATGTTTTGTAGT 2974 

ATTCATTGATTTTCTTTCACTGTACTTAAATTTAGTGTTAGTACTTTAAAATTTTTAATTTACCAGTCTTTAAAGCAAC 3053 

ATCCAGAAAAAAAAAAGTCTTTTCCCATTTAAAATAGGCTCAGCCAGTTCAATGTCGCCTTGTTATCAGAGAAATATTA 3132 

GTTCAATACTGAAAGAAAAATATTATAGCTCTTGGTATCTAGAAAAGCTTGTTCATCCATTATAAATATATCTTTAGCC 3211 

ACAGCAAACCACACTTAACCTATCTATAATAAAAATGTGCTTTAAATAMAAAAAAAAAAAAAAGGGCGGCCG 3284 
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COLFI C0LFI C0LFI 



Cys 
Ngly 



1 1 1 1 1 i i 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 41 81 121 161 201 
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COLFI: domain 1 of 3, from 58 to 116: score 110.3. E = 1.3e-42 

*-> I ksPeGksr knPARtCkDLf Lchpe f ksGeYWi DPNqGC i kDAikVf 
+k+P4G +r+nPAR CkDL c + +4G YWiDPN+GC4 DAi+Vf 
INT340 58 1 KNPLG-TRDNPAR I CKDL LNCEQKVSDGKYWI DPNLGCPSDA i EVF 103 

CnkrfetGvgeTCisp<-» 
Cn f 4G g+TC +p 
INT340 104 CN— FSAG-GQTCLPP 116 



COLF [ : domain 2 of 3. from 126 to 151: score 9.7, E = 0.0028 

*->i snvQI TFLRLLSteAsQN i HhCKN<-* 
+++vQ+ FL LLS+eA FT hC N 
INT340 126 VGKVQMNFLHLLSSEATHI ITIHCLN 151 



COLFI: domain 3 of 3, from 186 to 217: score 5.8, E = 0.09 

*->t v IGeDGCss r tgewgKTVi EyeTkKt tRLP I v<-# 
+vl D C+ g w K+ + + T+ + +LP + 
1NT340 186 KVL-SDDCK 1 QDGSWHKATF LFHTQEPNQLPV I 217 



FIG.3 
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Input file M003Athya30d3; Output File M003A thYo30d3 . pat 
Sequence length 3169 

M T P S P 5 

GTCGACCCACGCGTCCGCGCCCGCTGAGCCCCCCGCCGAGGTCCGGACAGGCCGAG ATG ACG CCG AGC CCC 71 

LLLLLLPPLLLGAFPPAAAA 25 

CTG TTG CTG CTC CTG CTG CCG CCG CTG CTG CTG GGG GCC TTC CCG CCG GCC GCC GCC GCC 131 

RG P PKMADKVVPRQVARLGR 45 

CGA GGC CCC CCA AAG ATG GCG GAC AAG GTG GTC CCA CGG CAG GTG GCC CGG CTG GGC CGC 191 

TV RL QCPVEGDPPPLTMW T K 65 

ACT GTG CGG CTG CAG TGC CCA GTG GAG GGG GAC CCG CCG CCG CTG ACC ATG TGG ACC AAG 251 

DGRT I HSGWSR FRVLPQGL K 85 

GAT GGC CGC ACC ATC CAC AGC GGC TGG AGC CGC TTC CGC GTG CTG CCG CAG GGG CTG AAG 311 

V KQVEREDAGVYVCKATNG F 105 

GTG AAG CAG GTG GAG CGG GAG GAT GCC GGC GTG TAG GTG TGC AAG GCC ACC AAC GGC TTC 371 

GSLSV-N. YTLVVL-DD I SP'GKE 125 

GGC AGC CTG AGC GTC AAC TAC ACC CTC GTC GTG CTG GAT GAC ATT AGC CCA GGG AAG GAG 431 

S LGPD SSSGGQEDPASQQWA 145 

AGC CTG GGG CCC GAC AGC TCC TCT GGG GGT CAA GAG GAC CCC GCC AGC CAG CAG TGG GCA 491 

RPRFTQPSKMRRRVIA RPVG 165 

CGA CCG CGC TTC ACA CAG CCC TCC AAG ATG AGG CGC CGG GTG ATC GCA CGG CCC GTG GGT 551 

SSVR LKCVASGHPR'PD I TWM 185 

AGC TCC GTG CGG CTC AAG TGC GTG GCC AGC GGG CAC CCT CGG CCC GAC ATC ACG TGG ATG 611 

KDDQA t TRP EAAEPRKKK'WT 205 

AAG GAC GAC CAG GCC TTG ACG CGC CCA GAG GCC GCT GAG CCC AGG AAG AAG AAG TGG ACA 671 

LS LKNL RPEDSGKY T CRVSN 225 

CTG AGC CTG AAG AAC CTG CGG CCG GAG GAC AGC GGC AAA TAC ACC TGC CGC GTG TCG AAC 731 

RAGAINATYKVDVIQRTRSK 245 

CGC GCG GGC GCC ATC AAC GCC ACC TAC AAG GTG GAT GTG ATC CAG CGG ACC CGT TCC AAG 791 

FIG.4A 
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PVLTGTHPVNTTVDFGGTTS 265 

CCC GTG CTC ACA GGC ACG CAC CCC GTG AAC ACG ACG GTG GAC TTC GGG GGG ACC ACG TCC 851 

FQCKVR SDVKPVI QWL KR VE 285 

TTC CAG TGC AAG GTG CGC AGC GAC GTG AAG CCG GTG ATC CAG TO CTC AAG CGC GTG GAG 911 

YGAEGRHNST I DVGGQKFVV 305 

TAC GGC GCC GAG GGC CGC CAC AAC TCC ACC ATC GAT GTG GGC GGC CAG AAG TTT GTG GTG 971 

L P TGD VWSRPDGSYLNKLL I 325 

CTG CCC ACG GGT GAC GTG TGG TCG CGG CCC GAC GGC TCC TAC CTC AAT AAG CTC CTC ATC 1031 

TR ARQD DAGMY I CLGANTMG 345 

ACC CGT GCC CGC CAG GAC GAT GCG GGC ATG TAC ATC TGC CTT GGC GCC AAC ACC ATG GGC 1091 

Y SFRSAF L TV LPD PKPPGPP 365 

TAC AGC TTC CGC AGC GCC TTC CTC ACC GTG CTG CCA GAC CCA AAA CCG CCA GGG CCA CCT 1151 

VA SSSSATSLPWPVV1GIPA 385 

GTG GCC TCC TCG TCC TCG GCC ACT AGC CTG CCG TGG CCC GTG GTC ATC GGC ATC CCA GCC 121 1 

GAVF I IGTL LLWLCQA QKKP 405 

GGC GCT GTC TTC ATC CTG GGC ACC CTG CTC CTG TGG CTT TGC CAG GCC CAG AAG AAG CCG 1271 

CTPAPAPPLPGHR. PPGTARD 425 

TGC ACC CCC GCG CCT GCC CCT CCC CTG CCT GGG CAC CGC CCG CCG GGG ACG GCC CGC GAC 1331 

R S G D K D L P S L A A L S A G P G V G 445 

CGC AGC GGA GAC AAG GAC CTT CCC TCG TTG GCC GCC CTC AGC GCT GGC CCT GGT GTG GGG 1391 

LC EEHGSPAAPQHLLGPGP V 465 

CTG TGT GAG GAG CAT GGG TCT CCG GCA GCC CCC CAG CAC TTA CTG GGC CCA GGC CCA GTT 1451 

AG PK LYP KLYT O IHT H THTH 485 

GCT GGC CCT AAG TTG TAC CCC AAA CTC TAC ACA GAC ATC CAC ACA CAC ACA CAC ACA CAC 151 1 

SH THSHVE GKVHQHI HYQC * 505 

TCT CAC ACA CAC TCA CAC GTG GAG GGC AAG GTC CAC CAG CAC ATC CAC TAT CAG TGC TAG 1571 

ACGGCACCGTATCTGCAGTGGGCACGGGGGGGCCGGCCAGACAGGCAGACTGGGAGGATGGAGGACGGAGCTGCAGACG 1650 

FIG.4B 
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AAGGCAGGGGACCCATGGCGAGGAGGAATGGCCAGCACCCCAGGCAGTCTGTGTGTGAGGCATAGCCCCTGGACACACA 1729 

CACACAGACACACACACTGCCTGGATGCATGTATGCACACACATGCGCGCACACGTGCTCCCTGAAGGCACACGTACGC 1808 

ACACACGCACATGCACAGATATGCCGCCTGGGCACACAGATAAGCTGCCCAAATGCACGCACACGCACAGAGACATGCC 1 887 

AGAACATACAAGGACATGCTGCCTGAACATACACACGCACACCCATGCGCAGATGTGCTGCCTGGACACACACACACAC 1966 

ACGGATATGCTGTCTGGACGCACACACGTGCAGATATGGTATCCGGACACACACGTGCACAGATATGCTGCCTGGACAC 2045 

ACAGATAATGCTGCCTTGACACACACATGCACGGATATTGCCTGGACACACACACACACACGTGTGCACAGATATGCTG 2 1 24 

TCTGGACACGCACACACATGCAGATATGCTGCCTGGACACACACTTCCAGACACACGTGCACAGGCGCAGATATGCTGC 2203 

CTGGACACACGCAGATATGCTGTCTAGTCACACACACACGCAGACATGCTGTCCGGACACACACACGCATGCACAGATA 2282 

TGCTGTCCGGACACACACACGCACGCAGATATGCTGCCTGGACACACACACAGATAATGCTGCCTCAACACTCACACAC 2361 

GTGCAGATATTGCCTGGACACACACATGTGCAGAGATATGCTGTCTGGACATGCACACACGTGCAGATATGCTGTCCGG 2440 

ATACACACGCACGCACACATGCAGATATGCTGCCTGGGCACACACTTCCGGACACACATGCACACACAGGTGCAGATAT 2519 

GCTGCCTGGACACACGCAGACTGACGTGCTTTTGGGAGGGTGTGCCGTGAAGCCTGCAGTACGTGTGCCGTGAGGCTCA 2598 

TAGTTGATGAGGGACTTTCCCTGCrCCACCGTCACTCCCCCAACTCTGCCCGCCTCTGTCCCCGGCTCAGTCCCCGCCT 2677 

CCATCCCCGCCTCTGTCCCCTGGCCTTGGCGGCTATTTTTGCCACCTGCCTTGGGTGCCCAGGAGTCCCCTACTGCTGT 2756 

GGGCTGGGGTTGGGGGCACAGCAGCCCCAAGCCTGAGAGGCTGGAGCCCATGGCTAGTGGCTCATCCCCACTGCATTCT 2835 

CCCCCTGACACAGAGAAGGGGCCTTGGTATTTATATTTAAGAAATGAAGATAATATTAATAATGATGGAAGGAAGACTG 2914 

GGTTGCAGGGACTGTGGTCTCTCCTGGGGCCCGGGACCCGCCTGGTCTTTCAGCCATGCTGATGACCACACCCCGTCCA 2993 

GGCCAGACACCACCCCCCACCCCACTGTCGTGGTGGCCCCAGATCTCTGTAATTTTATGTAGAGTTTGAGCTGAAGCCC 3072 

CGTATATTTAATTTATTTTGTTAAACATGAAAGTGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 3151 

AAAAAAAAGGGCGGCCGC 3169 

FIG.4C 
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ig: domain 1 of 3, from 44 to 101: score 36.4, E = 9.9e-10 

*~>GesvtL tCsvsgf gpp . p . vtWI rngk Islti.s 

G +V+L+C v g+p+p W-H4g++ +++++ + + I ++ 
M003 44 GRTVRLQCPVE— GDPpPI TMWTKOGRt i hsgwsr f rv IpQGLKVkQ 88 

vtpeDsgGtYtCvv<~* 
v++eD+ G+Y C + 
M003 89 VEREDA-GVYVCKA 101 



ig: domain 2 of 3, from 165 to 223: score 48.9, E - 1.3e-13 

*->GesvtLtCsvsgfgpp.p.vtWlrngk; > . I si ti . 

G+sv+L C +s g p+pHtW ++++ ++++ +++-H+ +1 44 
M003 165 GSSVRL KCVAS — GHPr Pd I TWMKDDQa 1 1 rpeaaeprkkkWTLSLk 209 

svtpeDsgGtYtCvv<-« 
+++peDs G YtC+v 
M003 210 NLRPEDS-GKYTCRV 223 



ig: domain 3 of 3, from 261 to 340: score 26.9, E = 8.8e-07 

*->GesvtLtCsvsgf gpp.p. vtWIrngk 

G++ +++C v+ 4+ +p ++WI+ + + fill Ml + ++++ 
M003 261 GGTTSFQCKVR— SDVkPv IQWLKRVEygaegrhns t idvggqkf vv 305 

lslti.svtpeDsgGtYtCvv<-* 

Htt +4+H4 I+I+-HH-++D+ G Y C 
M003 306 I ptgdvwsrpdgsy I NKLL I tRARQDDA-GMYICLG ■ 340 



FIG.6 
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neur_chon: domoin 1 of 1, from 388 to 397: score 1.4, E * 9.7 

*->vfvlGTIgif<-* 
vf+IGTI ++ 
M003 388 VF1LGTLLU 397 



FIG.7 
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Input file M003jfmjf004c11; Output File M003 j fmj f 004c1 1 .pat 
Sequence length 1074 

RV RPTG DVWS RPDGSYLNK 19 

CA CGC GTC CGG CCC ACG GGT GAT GTG TGG TCA CGG CCT GAT GGC TCC TAC CTC AAC AAG 59 

LLISRARQDDAGMYICLGAN 39 

CTG CTC ATC TCT CGG GCC CGC CAG GAT GAT GCT GGC ATG TAC ATC TGC CTA GGT GCA AAT 119 

T MGYSFRSAF L TVLPDPKPP 59 

ACC ATG GGC TAC AGT TTC CGT AGC GCC TTC CTC ACT GTA TTA CCA GAC CCC AAA CCT CCA 179 

GPP MASSS SSTSLPWP VVIG 79 

GGG CCT CCT ATG GCT TCT TCA TCG TCA TCC ACA AGC CTG CCA TGG CCT GTG GTG ATC GGC 239 

I PAG AVF 1 LGTVLLWLCQTK 99 

ATC CCA GCT GGT GCT GTC TTC ATC CTA GGC ACT GTG CTG CTC TGG CTT TGC CAG ACC AAG 299 

K KPCAP ASTLPVPGHRPPGT 119 

AAG AAG CCA TGT GCC CCA GCA TCT ACA CTT CCT GTG CCT GGG CAT CGT CCC CCA GGG ACA 359 

SRERSGDKDL P SL AVG I CE E 139 

TCC CGA GAA'CGC AGT GGT GAC AAG GAC CTG CCC TCA TTG GCT GTG GGC ATA.'TGT GAG GAG 419 

HGSAMAPQH 1 L ASGSTAGPK 159 

CAT GGA TCC GCC ATG GCC CCC CAG CAC ATC CTG GCC TCT GGC TCA ACT GCT GGC CCC AAG 479 

LYPKLYTDVH.THTHTHTCTH 179 

CTG TAC CCC AAG CTA TAC ACA GAT GTG CAC ACA CAC ACA CAT ACA CAC ACC TGC ACT CAC 539 

TLSCWRARF 1NTSMST I SAK199 

ACG CTC TCA TGT TGG AGG GCA AGG TTC ATC AAC ACC AGC ATG TCC ACT ATC AGT GCT AAA 599 

YSE SPSTVS* 209 

TAC AGC GAA TCT CCA AGC ACT GTG TCC TGA 629 



FIG.8A 
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GGT AGGCAT TTGGGGGCCA AGGCAACAGGT TGGG AGA AT TGAGAACAATGGAGGAAG AGTATCTTAGGGTGCCT T ATGG 708 

TGGACACTCACAAACTTGGCCATATAGATGTATGTACTACCAGATGAACAGCCAGCCAGATTCACACACGCACATGTTT 787 

AAACGTG TAAACG TG TGCACAACTGCACACACAACCTGAGAAACCT TCAGGAGGATTTGTGGTGTGACTT TGCAGTGAC 866 . 

ATGTAGCGATGGCTAGTTGAAGGAATCTCCCTCATGTCTTAGTGGTCATGGCCACTTCCCCACCCCTGCCCATCTGTGT 945 

TCCTGCCTGGCCTTGGTGGTGCTTCCGTGTGCCCTGGGTTTTCCAGGAACCCTATCAACCTGACTGGGGTGAGCAGTGC 1024 

AGCCATGCNTGGAGGTTTGAGCCACCCTCCCCTTGCTAGAGAGAAGGGCN 1074 

FIG.8B 
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PFAM 




Cys 
Ngly 



ii i ii 
i 



I , | . | . | , | . | . , . | i | . | ■ | 
1 41 81 121 161 201 
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Input file M347Alhbod295g12; Output File M347AIhbad295g12.pat 
Sequence length 1423 

MPGPRVWGKYLW 12 

GTCGACCCACGCGTCCGCCCACGCGTCC6G ATG CCT GGA CCC AGA GTG TGG GGG AAA TAT CTC TGG 66 

RSPH.SKGCPG A. MWWL LLWGV 32 

AGA AGC CCT CAC TCC AAA GGC TGT CCA GGC GCA ATG TGG TGG CTG CTT CTC TGG GGA GTC 126 

LQACPTRGSVLLAQELPQ QL 52 

CTC CAG GCT TGC CCA ACC CGG GGC TCC GTC CTC TTG GCC CAA GAG CTA CCC CAG CAG CTG 186 

TSPGYPE P YGKGQESSTD I K 72 

ACA TCC CCC GGG TAC CCA GAG CCG TAT GGC AAA GGC CAA GAG AGC AGC ACG GAC ATC AAG 246 

APEGF AVRLVFQDFDLEP SQ 92 

GCT CCA GAG GGC TTT GCT GTG AGG CTC GTC TTC CAG GAC TTC GAC CTG GAG CCG TCC CAG 306 

OCAG D SVTVSWGWGG S RQ DC 112 

GAC TGT GCA GGG GAC TCT GTC ACA GTG AGC TGG GGA TGG GGG GGG TCC CGC CAG GAC TGT 366 

GQ GDSRGCGKW RC PE SP I WR 132 

GGC CAG GGA GAT TCC CGG GGT TGT GGG AAG TGG CGG TGC CCT GAA TCC CCC ATC TGG AGG 426 

R D £ F S M * 139 

AGG GAT GAA TTT TCC ATG TAG 447 

GGGCAGTCGGGCTTGGCTTACCGGGGAGCAGTGGTGGACCCCAGGACACAGCCTCCCACCAGCGCCTCCGGGGCTGCCA 526 

TCTGGGCCCCACAGAGCAAAGAGGGCAGCAAGCAGGCCCTGCGTTTGGAAGGCTTATGAATGGACACACAAATCTTGCA 605 

AATCTATGGAGCCAGGGGCAGGGACGCACATATTGGTTGTTAAAAATATGTCATCATGTATTTGTTGAGTGCCTGCTCT 684 

ATCAGGTGAGGAAGCTGGACACAAATAATAACAAAAGATTAAGTCACCGTTCACACTTACCTTGGAAGAGCTATTACAA 763 

AACTTCTAACGCCAAAGCCTTATTCAGMTAAGGACATTTTAAAAACAGTACTTGATGGAGTGATGCAAGCTTGCAGTC 842 

CCAGCAGTATAGTCAGGAGACTGAGGCTGGAGGATCAGAGGGCTGGAGCCCAGGGTTCAAGGCCAGCCTAAGCAACATA 921 

FIG.10A 
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GCAAGACCCCATCTCAAAAATAAGTAAATAATAAATAAAAATAAAAAGAGCACATTATCTTTTGATTTAAATTTTATTT 1000 

ATATCAAAATGACATAAATTTTTGAACTTTATTTTTTAATTTTAAAATTTTTMTTATTATGGATACATAATAGTTGTA 1079 

AGACTTTTTGTTTTTTAATTAAAGTTTTCTAAGGCTGGGCGCAGTAGCTCATGTCTGTAGTCCCAGCACTTTGGGAGGC 1158 

TGAGGCGAAAGAAGCACTTGAGCCCAGGAATTTGAGACCAGCCTGGGCAACATAGCAAGACCCCATCTCTACAAAAAAA 1237 

TTTAAAAATTAGCCAAGTGTGGTGGCACGCACCTGTGGTCCCAGCTACAAGGGACGCTGAAGTGAGAGGATCACTTGAG 1316 

CCTGGAAGGTAGAGGCTGCAGTGAGCTCTGATCATGACACCGTACTCCAGCCTGGGTGACAGAGTGAGACCCTGTCTCC 1395 

AAAAAAAAAAAAAAAAAAGGGCGGCCGC 1423 

FIG.10B 
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M347 



Cys 

out 

TM 

ins 



CUB 



■ i i i ii 



i ' i * i ' i • i ■ i ' i ■ 

1 41 81 121 

FIG.1 1 
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CUB: domoin 1 of 1, from 40 to 136: score -17.7, E = 0.035 

*->CGgt I d I tessGs i sSPnYPnrsdYppnkeCvWr I rappqyr vVeLt 
G +1+ +e + ++SP+YP+ +Y +e I op+g+ V L 
hM347 40 -GSVLL AQELPQQL TSPGYPE- - PYGKGQESSTO I KAPEGFA-VRLV 82 

FqdFdlEdhdgapCryDyvEirOGdpss.pl IG. . . .rfCG. . . .sgkPe 
FqdFdIE +++ C4 0+v + G ++S++ G444+r CG+ + ++P 
HM347 83 FQDFDLEPSQD — CAGDSVTVSWGVKGGSrQDCGqgdsRGCGkwrcPESP- 129 

dirStsnrml ikFvsOasvskrGFkAty<-* 
+ +D+ + 
hM347 130 IWRROE F 136 



FIG. 12 
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Input file T272Athdo89h3; Output File T272Athdo89h3.pat 
Sequence length 5036 

GTCGACCCACGCGTCCGCTCGAAGCGGGGACCCTCGCCCCGTCCTCGGCTGTCCAGTCCTCCTCCTCGCAGACCCCGGC 79 

GGTTCCTACCCCAGGCCGCAGGGGAGACGGTGCCCCAAGGCAGGCTTCATATCCTGAACGCTGGGATCCCCCAGGACAT 158 

M S 2 

TCCCTGGCCCCCAGGCCCCAGGTCCCAGGCCCCAGGGCTGAGCTGTGGGCAGGCCCCACCTGGCCTCTGCA ATG TCA 235 

PPL CPL LLLA VG LRLAGTLN 22 

CCG CCT CTG TGT CCC CTC CTT CTC CTG GCT GTG GGC CTG CGG CTG GCT GGA ACT CTC AAC 295 

PS D PN TCSF W E S F T T T TKE S 42 

CCC AGT GAT CCC AAT ACC TGC AGC TTC TGG GAA AGC TTC ACT ACC ACC ACC AAG GAG TCC 355 

HSRPF S LLPSEPCERPWEGP 62 

CAC TCC GGC CCC TTC AGC CTG CTC CCC TCA GAG CCC TGC GAG CGG CCC TGG GAG GGC CCC 415 

HTCPSPQTQRKLLASR DSFC 82 

CAT ACT TGC CCC AGC CCA CAA ACT CAG AGG AAA CTC CTG GCT TCT AGG GAT TCA TTC TGC 475 

MVCV GAGVQWRDRSAL QPQT 102 

ATG GTC TGT GTC GGG GCT GGA GTG CAG TGG CGA GAT CGT AGT GCA CTG CAA CCT CAA ACA 535 



GN AL SMRPQ'PRVL SGA PSL A 122 
GGG AAT GCG CTT TCT ATG CGC CCT CAG CCC AGA GTG TTG AGT GGT GCC CCT TCC CTG GCC 595 



SPGHTVVVKTDHRQRLQCCH 142 

TCC CCT GGC CAC ACT GTG GTG GTG AAG ACG GAC CAC CGC CAG CGC CTG CAG TGC TGC CAT 655 

GFYES RG FCVPLCAQE CVHG 162 

GGC TTC TAT GAG AGC AGG GGG TTC TGT GTC CCG CTC TGT GCC CAG GAG TGT GTC CAT GGC 715 

RCVAPNQC QCV PGWRGDD C S 182 

CGT TGT GTG GCA CCC AAT CAG TGC CAA TGT GTG CCA GGC TGG CGG GGC GAC GAC TGT TCC 775 



SAPNCLQPCTPGYYGPAC QF 202 
AGT GCC CCG AAC TGC CTT CAG CCC TGT ACC CCT GGC TAC TAT GGC CCT GCC TGC CAG TTC 835 



FIG.13A 
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RCQ C HGAPCDPQ TG ACF CP A 222 

CGC TGC CAG TGC CAT GGG GCA CCC TGC GAT CCC CAG ACT GGA GCC TGC TTC TGC CCC GCA 895 

E R T G P S C D V S C S Q G T S G F F C 242 

GAG AGA ACT GGG CCC AGC TGT GAC GTG TCC TGT TCC CAG GGC ACT TCT GGC TTC TTC TGC 955 

PST H PCQ NG GVFQTPQ GSCS 262 

CCC AGC ACC CAT CCT TGC CAA AAT GGA GGT GTC TTC CAA ACC CCA CAG GGC TCC TGC AGC 1015 

CP PGWM G T I CSLPCPEGFH G 282 

TGC CCC CCT GGC TGG ATG GGC ACC ATC TGC TCC CTG CCC TGC CCA GAG GGC TTT CAC GGA 1075 

PNCSQE CR CHNG GLCDRFTG 302 

CCC AAC TGC TCC CAG GAA TGT CGC TGC CAC AAC GGC GGC CTC TGT GAC CGA TTC ACT GGG 1135 

Q CRC AP GYTGDRC R E E CPVG 322 

CAG TGC CGC TGC GCT CCG GGT TAC ACT GGG GAT CGG TGC CGG GAG GAG TGC CCG GTG GGC 1195 

R F G Q 0 C A ETC D C A P D A R C F P 342 

CGC TTT GGG CAG GAC TGT GCT GAG ACG TGC GAC TGC GCC CCG GAC GCC CGT TGC TTC CCG 1255 

ANGA CLCEHGFTGDRCTDRL 362 

GCC AAC GGC GCA TGT CTG TGC GAA CAC GGC TTC ACT GGG GAC CGC TGC ACG GAT CGC CTC 1315 

CPD GFYGLSCQAPCTCDREH 382 

TGC CCC GAC GGC TTC TAC GGT CTC AGC TGC CAG GCC CCC TGC ACC TGC GAC CGG GAG CAC 1375 

S LSCHP MNGECSCLPGWAGL 402 

AGC CTC AGC TGC CAC CCG ATG AAC GGG GAG TGC TCC TGC CTG CCG GGC TGG GCG GGC CTC 1435 

HCNE SCPQD TH GPG C QE H CL 422 

CAC TGC AAC GAG AGC TGC CCG CAG GAC ACG CAT GGG CCA GGG TGC CAG GAG CAC TGT CTC 1495 

CLHGGVC QATSG LCQC APGY 442 

TGC CTG CAC GGT GGC GTC TGC CAG GCT ACC AGC GGC CTC TGT CAG TGC GCG CCG GGT TAC 1555 

TG PHCASLCPPDT YG VNCSA 462 

ACG GGC CCT CAC TGT GCT AGT CTT TGT CCT CCT GAC ACC TAC GGT GTC AAC TGT TCT GCA 1615 

FIG.13B 
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RCSC ENA I ACSP I DGE CVC K 482 

CGC TGC TCA TGT GAA AAT GCC ATC GCC TGC TCA CCC ATC GAC GGC GAG TGC GTC TGC AAG 1675 

EGW QRGNCSVP CPPGTWGFS 502 

GAA GGT TGG CAG CGT GGT MC TGC TCT GTG CCC TGC CCA CCC GGA ACC TGG GGC TTC AGT 1735 

C NAS CQCAHEAVCS PQTGAC 522 

TGC AAT GCC AGC TGC CAG TGT GCC CAT GAG GCA GTC TGC AGC CCC CAA ACT GGA GCC TGT 1795 

T CT PGWHGAHCQLPCP KGQF 542 

ACC TGC ACC CCT GGG TGG CAT GGG GCC CAC TGC CAG CTG CCC TGT CCG AAG GGG CAG TTT 1855 

G E G C A S R C D C D H S D G C 0 P V.H 562 

GGA GAA GGT TGT GCC AGT CGC TGT GAC TGT GAC CAC TCT GAT GGC TGT GAC CCT GT-T CAT 1915 

GRCQCQAGWMGARCHL SCPE 582 

GGA CGC TGT CAG TGC CAG GCT GGC TGG ATG GGT GCC CGC TGC CAC CTG TCC TGC CCT GAG 1975 

GLWGVNCSNTCTCKN GGTCL 602 

GGC TTA TGG GGA GTC AAC TGT AGC AAC ACC TGC ACC TGC AAG AAT GGG GGC ACC TGT CTC 2035 

PENGNCVCAPG F RGPSCQRS 622 

CCT GAG AAT GGC AAC TGC GTG TGT GCA CCC GGA TTC CGG GGC CCC TCC TGC CAG AGA TCC 2095 

C QPGRYGKR CVPCKCAN HSF 642 

TGT CAG CCT GGC CGC TAT GGC AAA CGC TGT GTG CCC TGC AAG TGC GCT AAC CAC TCC TTC 2155 

CHPSNGTCYCL AGWTGPDCS 662 

TGC CAC CCC TCG AAC GGG ACC TGC TAC TGC CTG GCT GGC TGG ACA GGC CCC GAC TGC TCC 2215 

QP CPPGHWGENCAQTC QC HH 682 

CAG CCA TGC CCT CCA GGA CAC TGG GGA GAA AAC TGT GCC CAG ACC TGC CAA TGT CAC CAT 2275 

GGTC HPQDGSC ICPLGWTG H 702 

GGT GGG ACC TGC CAT CCC CAG GAT GGG AGC TGT ATC TGC CCC CTA GGC TGG ACT GGA CAC 2335 

HC L E'GC P L G T FGANCS QPCQ 722 

CAC TGC TTA GAA GGC TGC CCT CTG GGG ACA TTT GGT GCT AAC TGC TCC CAG CCA TGC CAG 2395 

FIG.13C 
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C GPGE K CH PE TGACVCP PG H 742 

TGT GGT CCT GGA GAA AAG TGC CAC CCA GAG ACT GGG GCC TGT GTA TGT'CCC CCA GGG CAC 2455 

SGA PCR JG I QEPF T V MPT T P 762 

AGT GGT GCA CCT TGC AGG ATT GGA ATC CAG GAG CCC TTT ACT GTG ATG CCG ACC ACT CCA 2515 

V A Y N S L G A V I G 1 A V L G S L V V 782 

GTA GCG TAT AAC TCG CTG GGT GCA GTG ATT GGC ATT GCA GTG CTG GGG TCC CTT GTG GTA 2575 

AL VAL F IGYRHWQKGKEHHH 802 

GCC CTG GTG GCA CTG TTC ATT GGC TAT CGG CAC TGG CAA AAA GGC AAG GAG CAC CAC CAC 2635 

LAVAYSSGRLDGSEYVMPDV 822 

CTG GCT GTG GCT TAC AGC AGC GGG CGC CTG GAC GGC TCC GAG TAT GTC ATG CCA GAT GTC 2695 

PPSY SHYYSNPSY HTLSQCS .842 

CCT CCG AGC TAC AGT CAC TAC TAC TCC AAC CCC AGC TAC CAC ACC CTG TCG CAG TGC TCC 2755 

PNPP PPNKVPGPLFASLQNP 862 

CCA AAC CCC CCA CCC CCT AAC AAG GTT CCA GGC CCG CTC TTT GCC AGC CTG CAG AAC CCT 2815 

ERP GG A QG HDN HTTLPADWK 882 

GAG CGG CCA GGT GGG GCC CAA GGG CAT GAT AAC CAC ACC ACC CTG CCT GCT GAC TGG AAG 2875 

HRR EPPPGPLDRG SS RLDRS 902 

CAC CGC CGG GAG CCC CCT CCA GGG CCT CTG GAC AGG GGG AGC AGC CGC CTG GAC CGA AGC 2935 

YS YSYS NGPGP FYDKGL ISE 922 

TAC AGC TAT AGC TAC AGC AAT GGC CCA GGC CCA TTC TAC GAT AAA GGG CTC ATC TCT GAA 2995 

EELGASVASLSSENPYAT1R 942 

GAG GAG CTC GGG GCC AGT GTG GCT TCC CTG AGC AGT GAG AAC CCA TAT GCC ACC ATC CGG 3055 

D L PS L PGG PRE SSYME MKG P 962 

GAC CTG CCC AGC TTG CCA GGG GGC CCC CGG GAG AGC AGC TAC ATG GAG ATG AAA GGC CCT 31 15 

PSGSAPRQPPQFWDSQRRRQ 982 

CCC TCA GGA TCT GCC CCC AGG CAG CCT CCT CAG TTT TGG GAC AGC CAG AGG CGG CGG CAA 3175 

FIG.13D 
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P QPQRD S6 TYEQP S PL IHD R 1002 

CCC CAG CCA CAG AGA GAC AGT GGC ACC TAC GAG CAG CCC AGC CCC CTG ATC CAT GAC CGA 3235 

DSVGSQPPL PPGLPPGHYDS 1022 

GAC TCT GTG GGC TCC CAG CCC CCT CTG CCT CCG GGC CTA CCC CCC GGC CAC TAT GAC TCA 3295 

PKNSHIPGH YDLPPVRHPPS- 1042 

CCC AAG AAC AGC CAC ATC CCT GGA CAT TAT GAC TTG CCT CCA GTA CGG CAT CCC CCA TCA 3355 

PPL R R Q D R * 1051 

CCT CCA CTT CGA CGC CAG GAC CGT TGA 3382 

GGAGCCAGGATGGTATGGCAGAGGCCAGCACACCTGGCTGTTGCTGCTCAAGGCTGGGGACAGAGCCTAGTGTACCCCT 3461 

GCCAGGAGCAGGGAGTGGACCGGCAGGCTGTGAACATGAACAACGCTTAACAGAGCAAGTGATGGGAGCCTTGTTCCTG 3540 

GGTTCTACCATGGGAGACGCTGATCAGCAGGATGCCTGGCTCCCTTTCCCAACCCACTGCTCCCAAGGCCTCCAGGGCC 3619 

CTGTGTACATAAACTGGTGGGTTGGAAGTTGCTGGGTAACTCTGATTTCAGACATGCGTGTGGGGTACCTTTTCTGTGC 3698 

ATGCTCAGCCTGGGCTCTGTGCGTGTGTGTGTTTCTGTGATTTTAGAAGGGTACCAGGCAGGTTCTGTCCTAGGGCACT 3777 

TACCATTTAGTAGGGAGATGGAACCAACCCAATTAACTCTAGCAATAGCCTCCTMCTGGCCTCCTCCATTGATTCAGT 3856 

GAACCTTCCAATGCATGGCTCATAATTTCAAAATACAGGCTGGTTAGTTACTCCCTACCTGAAAGCCTTCATAGGTGCC 3935 

TCTTTGCTCTTCTGCCAGTATCAAAACTTTTGAAGGCCTTAAAGGCCCTGCTTTGCCTGGCCCATCTGTCTCTCCAGCC 40 1 4 

TCACCTTGAACTGTGTTCCTGTCACTGCACGCCAGTCACACCGGCCTCTAGGTCCTCCTGTAGGCCACTCTTCTTTCTG 4093 

GCACAGGGACCTGCACACCTGGAGTGCCCTTCCTCCCCCACTCGCCTGTTCACCCCTGCTTTTCCTTTACACCTCCTCC 4172 

TCAGGGAAGTGCCCACCCTCCGTACATCTTTCACAGCCCTGATTGCAGCTGTGTTCACTCACCAGGTACCTGCAGAAGG 4251 

CCTACAGGGTGCCAGGCACTTCTTTAATGGGTTCTTTCTTTATGTGATTATTTGATTAATCTCTGCCTCCCCCACTAGA 4330 

CTGTAAGCTCCCTGAAGGCAAGAATCCTGTGCTTATGCTCAATATTAGCTCTCCCTTGGCACAGAGTAGGCACTCAACA 4409 

AATGCTCCCCAAAAGGCTGAGTGGCTGACTGAATTAAGTACCAGTGACATGCAGTAACTGCTAAGATAGATGAGCCATC 4488 

FIG.13E 



SUBSTITUTE SHEET (RULE 26) 

•.tBNSDOCID: <WO 0100673A1_I8> ih«ww„h „vajw. 



24/95 



TGTATGCTCTGACAGTTACAGACTGAATAAGTTGGAGACTTCCCTAAAGGGTGGCATTTCCCCAGGGTAACAACGCAGA 4567 

GCTCAGGTGTGGGAAGGTGCCAGGGGCAGGGGTGCAGAGGGGCTGAGGCTGAGGGGGGTGCAGAGGCTGGAGAAAGGAT 4646 

AACAGGAGAGAGTATACAGGCATGCCTTGATTTATTGCACTTCACAGGTAGCAGAATTTTTAAAGAAATTGAAGGTTTT 4725 

GGGACATATATGTGACAGCAATAGG7TAAGAAAAGCAAAGCAGAGAAATTGAAGATTTGTGTCAACACTGCTTTAAGCA 4804 s 

AATCTGTTGGCACCATTTTTCCAATAGCATGTGCCCATTTTGGGTCTCTACATTGCATTTTGGTAATTGCTTGCAATAT 4883 

TTCAAGCATTTTCATTGTTATTATATGTGTTATAGTGATCTGTGATCAGTGATCTTTGATATATTATTGTAATTGTTTC 4962 

GGGGCGCCATGAACCGCACCCATATMCACGGTAAACTTAATCAGCAAAAAAAAAAAAAAAAAAAGGGCGGCCG 5036 

FIG.13F 
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EGF: domain 1 of 14, from 151 to 181: score 14.0, E = 1.2 

*->Capnn . . pCsngG tCvn t pggssdn f ggy ICeCppGdyy I sy tGkrC 
C p++ + C + G+Cv +C+C pG + G++C 
hT272 151 CVPLCoqECVH-GRCVAPN QCQCVPG WRGDDC 181 



EGF: domain 2 of 14, from 200 to 229: score -2.2, E = 36 

*->CapnnpCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC<- 
C+ + C++ + C + g C+Cp tG+ C 

hT272 200 CQFRCQCHG-APCDPQTG — ACFCPAE RTGPSC 229 



EGF: domain 3 of 14, from 242 to 272: score 16.0, E = 0.81 

*->CapnnpCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC<- 
C+++ pC+ngG+ + g +C CppG + G C 
hT272 242 CPSTHPCQNGGVFQTPQG — SCSCPPG WMGTIC 272 



EGF: domain 4 of 14, from 285 to 315: score 27.0, E = 0.00045 

*->CapnnpCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC<- 
C++++ C+ngG C g +C+C+pG ytG+rC 

hT272 285 CSQECRCHNGGLCDRFTG— -QCRCAPG— — YTGDRC 315 



FIG.15A 
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EGF: domain 5 of 14, from 328 to 358: score 18.0, E = 0.22 

*->CapnnpCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC<- 
Co+++ C +++C + g C C -K5 +tG+rC 
hT272 328 CAETCDCAPDARCFPANG ACLCEHG FTGDRC 358 



EGF: domain 6 of 14. from 378 to 404: score 7.4, E = 4.9 

*->CopnnpCsngGtCvntpggssdnfggytCeCppGdyylsylGkrC<- 
C+ + + C++ g +C C pG ++G +C 
hT272 378 CDRE — HSLSCHPMNG ECSCLPG WAGLHC 404 



EGF: domain 7 of 14, from 417 to 447: score 29.2, E == 9.3e-05 

*->CapnnpCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC<- 
C++++ C++gG+C+ t g C+C+pG ytG++C 
HT272 417 CQEHCLCLHGGVCQATSG LCQCAPG YTGPHC 447 



EGF: domain 8 of 14, from 460 to 490: score 6.0, E = 6.5 

*->CapnnpCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC<- 
C+ + C n C + g +C+C4+G ++ +C 
hT272 460 CSARCSCENAIACSPIDG- ECVCKEG WQRGNC 490 

FIG.15B 
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EGF: domain 9 of 14, from 503 to 533: score 15.9, E = 0.82 

*->CapnnpCsngG tCvn t pggssdn f ggy tCeCppGdyy I sy tGkrC<- 
C+ + C + ++C + g " C+C+pG ++G +C 
hT272 503 CN ASCQC AHE AVCSPQTG ACTCTPG WHGAHC 533 



OSL: domain 1 of 1, from 518 to 576: score -20.5, £ = 6.8 

*->Wstdkh iggr tslGfnl eyr irvtCdenYYGegCnkFCrPrdDaf gH 
+t+ + + + ++ C + 4GegC+ C+ H 
hT272 518 -QTGACTCTPG— -WHGAHCGLPCPKGQFGEGCASRCDCD H 554 



y t . Cd . enGn k I C I eGWkGey C<-* 
+ +Cd+ -KM- +C +GW+G C 
hT272 555 SDgCDpVHGRCQCQAGYMGARC 576 



EGF: domoin 10 of 14, from 546 to 576: score 11.7, E = 2 

*->CapnnpCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC<- 
CQ+ + C++ C -H+g +C+C+G f G rC 
hT272 546 CASRCDCDHSDGCDPVHG RCQCQAG WMGARC 576 



EGF: domoin 11 of 14, from 589 to 619: score 17.9, E = 0.24 

*->CopnnpCsng6tCvn tpggssdn f ggy tCeCppGdyy i sytGkrC<- 
C+ ++ C+ngGtC++ g C+C+pG + G+ C 
hT272 589 CSNTCTCKNGGTCLPENG — NCVCAPG FRGPSC 619 

FIG.15C 
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EGF: domain 12 of 14, from 632 to 661: score 18.0, E - 0.23 

*->CapnnpCsngG tCvn t pggssdn f ggy ICeCppGdyy i sy tGkrC<- 
C p C n+ +C+++ g tC C G +tG-HC 
hT272 632 CVPC-KCANHSFCHPSNG TCYCLAG WTGPDC 661 



EGF: domain 13 of 14, from 674 to 704: score 27.1, E = 0.00042 

*->CopnnpCsngG tCvn t pggssdn f ggy tCeCppGdyy I sy tGkrC<- 
Ca+++ C++gGtC++ g +C+Cp G +tG++C 
hT272 674 CAQTCQCHHGGTCHPQDG — SCICPLG WTGHHC 704 



EGF: domain 14 of 14, from 717 to 747: score 1.7, E = 16 

*->CapnnpCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC<- 
C++++ C g +C++ g C+CppG +G C 
hT272 717 CSQPCQCGPGEKCHPETG ACVCPPG HSGAPC 747 

FIG.15D 
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Input file t272Atmzb62c4; Output File t272Atmzb62c4. pot 
Sequence length 2569 

STHA SGDPVHGQCR CQ A GW 19 

G TCG ACC CAC GCG TCC GGT GAC CCT GTT CAT GGA CAG TGC CGA TGT CAG GCT GGT TGG 58 

MGTRCHLPCPE GFWGANCSN 39 

ATG GGG ACA CGC TGC CAC CTG CCT TGC CCG GAG GGC TTT TGG GGA GCC AAC TGC AGT AAC 118 

T C T C K.N G G T C V S E N G N C V C A 59 

ACC TGT ACC TGC AAG AAT GGT GGT ACC TGT GTG TCT GAG AAT GGC AAC TGC GTG TGC GCA 178 

P G F R G P S C Q R P C P P G R Y G K R 79 

CCA GGG TTC CGA GGC CCC TCC TGC CAG AGG CCC TGC CCG CCT GGT CGC TAT GGC AAA CGC 238 

C V Q C K C N N N H S S C H P S D G T C 99 

TGT GTG CAA TGC AAG TGT AAC AAC AAC CAT TCT TCC TGC CAC CCA TCG GAC GGG ACC TGC 298 

SC L A GW TG PDCSE ACP P G HW 119 

TCC TGC CTG GCG GGC TGG ACA GGC CCT GAC TGC TCC GAG GCA TGT CCC CCA GGC CAC TGG 358 

GL KCSQL CQC HHG GTCHPQD 139 

GGA CTC AAA TGC TCC CAA CTC TGC CAG TGT CAT CAT GGT GGG ACC TGC CAC CCC CAG GAT 418 

GSC1CTPG WTGPN CLEGCPP 159 

GGG AGC TGT ATC TGC ACG CCA GGC TGG ACT GGA CCC AAC TGC TTG GAA GGC TGC CCA CCA 478 

RMF GV NC SQL C QCO LG EMCH 179 

AGA ATG TTT GGT GTC AAC TGC TCC CAG CTA TGT CAG TGT GAT CTC GGA GAG ATG TGC CAC 538 

PETGACVCP PGHSGADCKMG 199 

CCA GAG ACT GGG GCT TGT GTC TGT CCC CCA GGA CAC AGT GGT GCA GAC TGC AAA ATG GGA 598 

SQESFT IMPTSPVTHNSLGA 219 

AGC CAG GAG TCC TTC ACC ATA ATG CCC ACC TCT CCC GTG ACC CAT AAC TCA CTG GGT GCA 658 

VIGIAVLGTLVVAL1ALF1G 239 

GTG ATT GGC ATT GCA GTA CTG GGA ACC CTC GTG GTG GCC CTG ATA GCA CTG TTC ATT GGC 718 

FIG.16A 
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YR Q W Q KG KE H E HL AVAYST G 259 

TAC CGC CAG TGG CAA AAG GGC AAG GAA CAT GAG CAC TTG GCA GTG GCT TAG AGC ACT GGG 778 

RLDGSDYVMPDVSPSYSHYY 279 

CGG CTG GAT GGC TCT GAT TAC GTC ATG CCA GAT GTC TCT CCG AGC TAT AGT CAC TAC TAC 838 

S NPS YHT LSQCSPNPPPPNK 299 

TCC AAC CCC AGC TAC CAC ACA CTG TCT CAG TGT TCT CCT AAC CCC CCG CCC CCT AAC AAG 898 

V P G S Q L F V S S Q A P E RP S RA H 319 

GTC CCA GGC AGT CAG CTC TTT GTC AGC TCT CAG GCC CCT GAG CGG CCA AGC AGA GCC CAC 958 

G R E NH T T L PA DWK H R REP H D 339 

GGG CGT GAG AAC CAT ACC ACA CTG CCC GCT GAC TGG AAG CAC CGC CGG GAG CCC CAT GAC 1018 

RGASHLDRSYSCSYSHRN GP 359 

AGA GGC GCC AGC CAC CTG GAC CGA AGC TAT AGC TGT AGC TAT AGC CAC AGG AAT GGC CCA 1078 

GPFCHKGP I SEEGLGASVMS 379 

GGA CCA TTC TGT CAT AAA GGT CCC ATC TCT GAA GAG GGA CTA GGG GCA AGC GTT ATG TCC 1138 

LSSENPYAT I R D LPS LPGE P 399 

CTG AGC AGT GAG AAC CCC TAT GCT ACC ATC CGA GAC CTG CCC AGC CTG CCT GGG GAA CCC 1 198 

R ESGYVEMKGPPSVSPPR QS 419 

CGA GAA AGT GGC TAT GTG GAG ATG AAA GGA CCT CCA TCA GTG TCC CCT CCC AGG CAG TCT 1258 

L H L RD RQQRQ L QPQ RD SG T Y 439 

CTT CAT CTC CGG GAC AGG CAG CAG CGG CAA CTG CAG CCA CAG AGG GAC AGC GGC ACC TAT 1318 

EQPSP L SHNEESLGST PPL P 459 

GAG CAG CCC AGC CCC TTG AGC CAT AAT GAA GAG TCT TTG GGC TCC ACG CCC CCG CTT CCT. 1378 

PGLPP GHYDSP KNSHIPGHY 479 

CCA GGC CTG CCT CCT GGT CAC TAC GAC TCC CCC AAG AAC AGC CAT ATC CCT GGA CAC TAT 1438 

DLPPVRHPP SPPSRR QDR* 498 

GAC TTG CCT CCA GTA CGG CAT CCT CCA TCC CCT CCA TCC CGG CGC CAG GAC CGC TGA 1495 

FIG.16B 
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AGAGCCGGCATGGTATGGGAGCGTGCCTATGTACCTTGCCAGGAGCAGGGACTGGACCAGCAGGCCACGAACAGAAACA 1574 

CTTGGTGAAGTGAACAGAGACGGACTGTGGCCCTGTGCTTCCACCGAGGGAGACACTAGTTGACAAAGTGTCTAACCCT 1653 

CTTTTCCAACCCACTGCTCAAGTCCCTGTGGACATAAGCTGGTGGGCAGAATGTTGTTGTACAAGTGTGATTTTAGATC 1 732 

GATTTTTTTTTAAAGTATGTGTTGGGTACCTTTTCTGTGTGTATGCTCAGGCAGGCTGTGTGTGTCTCTAGTTGGCTTT 1811 

AGAGGGAGTCAGGTATAGGTTCTGCCTTCTGCACTTTCCATCTTATCTAGTAGTCAGCTTCCAAGCTTAACTAGTTAGA 1890 

GCTCCACCAGCAGCAGGCCCTAACTACCTGCCTGCCCTTCACCCAGTAATCCTCCATGTCTTTGCTCAGAGGATTGCTC 1969 

CCCGACTCTGGTGTTGTCCTCCTGGTACGCCTTGACGGTCCTGCAGTCTCCCTTTCCCGTCTTGCTTCATTCTTTCCCA 2048 

GAATGAAGGCTGTCTGCCACCCTACTTCCCAGCCCAGGAATTGGCACATCTAAGTTCAGCCTTCCTAAGTTACCCGTTG 2127 

AGTCCTGCTTGCCCTTCACATATTCCACAGAACACCCACCCCACATCTGCTTCATAGCTACTCTCTTCTCCACGTACCC 2206 

ACAGAAGGCAGAAGTGGTACCAGGCAAGAAGATGGGATTGTTGCATTTTGTTTTGTTTTTGAGACTCTGTCTCACTATG 2285 

TAGTCCTGGCTGGCCTGGAACTCAAGAGCTCTGCCTGCCTCTGCCTCTTGAGTGCTGGGTTTAACGGCTCAGGGTCACA 2364 

TGCACAGCTCAAGCTGCACTCCGATGTGCTTTCCCCTGTTGCTAGATTAGCGTCTGCCTCCCCCTAGTGGAGAGGCTGA 2443 

TCGCCAGCTCTCTGATGCAGGACTCTGGTGTTTAGGCTCACTCACTATTGGTTTCCTTGGCACAGGGTAGTCACTCAAT 2522 

AAATGTTCCTCTAAAAGCTGAAAAAAAAAAAAAAAAAGGGCGGCCGC 2569 

FIG.16C 
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Input file T295Athyb23d9; Output File T295Athyb23d9.pat 
Sequence length 1497 

GTCGACCCACGCGTCCGGCTCCCAGCCGACCCCCAAACAGACACAGCGTAGCCCGGGCCAGCTCTTAAGGA6TTCAGGA 79 

GTGAGAAGAGGCCCTCAGAGATCTGACAGCCTAGGAGTGCGTGGACACCACCTCAGCCCACTGAGCAGGAGTCACAGCA 158 

M A P A R 5 

CGAAGACCAAGCGCAAAGCGACCCCTGCCCTCCATCCTGACTGCTCCTCCTAAGAGAG ATG GCA CCG GCC AGA 231 

AG F CP L L L L L L LG LWVAE I P 25 

GCA GGA TTC TGC CCC CTT CTG CTG CTT CTG CTG CTG GGG CTG TGG GTG GCA GAG ATC CCA 291 

VSAK PKGMT SSQWFKIQHMQ 45 

GTC AGT GCC AAG CCC AAG GGC ATG ACC TCA TCA CAG TGG TTT AAA ATT CAG CAC ATG CAG 351 

P SPQ ACN'SAMKNl NKHTKRC 65 

CCC AGC CCT CAA GCA TGC AAC TCA GCC ATG AAA AAC ATT AAC AAG CAC ACA AAA CGG TGC 411 

KDLN TFLHEPFSSVAATCQ T 85 

AAA GAC CTC AAC ACC TTC CTG CAC GAG CCT TTC TCC AGT GTG GCC GCC ACC TGC CAG ACC 471 

PK IACKNGDKNCHQSHG PVS105 

CCC AAA ATA GCC TGC AAG AAT GGC GAT AAA AAC TGC CAC CAG AGC CAC GGG CCC GTG TCC 531 

LT MCKLT SGKYPNCRYKE K R125 

CTG ACC ATG TGT AAG CTC ACC TCA GGG AAG TAT CCG AAC TGC AGG TAC AAA GAG AAG CGA 591 



QNKSY V VAC KPPQKKDSQQF 145 
CAG AAC AAG TCT TAC GTA GTG GCC TGT AAG CCT CCC CAG AAA AAG GAC TCT CAG CAA TTC 651 



HLVPV HLDRVL* 
CAC CTG GTT CCT GTA CAC TTG GAC AGA GTC CTT TAG 



157 
687 



FIG.18A 
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GTTTCCAGACTGGCTTGCTCTTTGGCTGACCTTCAATTCCCTCTCCAGGACTCCGCACCACTCCCCTACACCCAGAGCA 766 

TTCTCTTCCCCTCATCTCTTGGGGCTGTTCCTGGTTCAGCCTCTGCTGGGAGGCTGAAGCTGACACTCTGGTGAGCTGA 845 

GCTCT AGAGGG ATGGCT T TTC ATCT TTTTG TTGCTG T T T TCCCAGATGCTTAT CCCCAAG AAACAGCAAGCTCAGG TCT 924 

GTGGGTTCCCTGGTCTATGCCATTGCACATGTCTCCCCTGCCCCCTGGCATTAGGGCAGCATGACAAGGAGAGGAAATA 1003 

AATGG A A AGGGGGCAT ATGGG AT T TGTGG ACACAGC TG T T TCTGT TCCTGAAC T AG AAG TCTTCCCCAGCTCTG ACG TG 1082 

GCAGTGAGGTGACCTGAAGGAAAGAAAAATATAAATAAATACCACTTCATATTTGTATAGAATCCTCTAATCCCTTGTG 1161 

ACATAGACTTGACAGGGATTGTATGCCTTCTTTATGGATGAGGAAATTAAGGTTTTAGAAAGCTTAATGAATTAAAGAG 1240 

CTTGTCTAATTAGTTAGTAGCAGAACCTGGACTTGAACCTAGGTCTCCTTGCTCTAAATACAGTGTACCTTCTACTCTA 1319 

CCAGTTGCGCAAGAAAGAAGTCACTGTTACAGAGGCAAGCGGTGAACTAGGTAAGAGTTCACTCATGAAGAAACGAGTG 1 398 

CTCTGAAGAGCCAGTTACCCTGTGTTGGCTGCAATMAGGTCATTACCTCTCTAGCCAAAAAAAAAAAAAAAAAAAAAA 1 477 

AAAAAAAAAAAAAAAAAAAA 1497 

FIG.18B 
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TM 
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Ngly 1 ' i i 1 1 1 i iv i* 
out 



2.0' 



I . | . | . | . | . | i | , | > 
1 41 81 121 



FIG.19 



BNSDOCID: <WO^_0100673A1JB> 



SUBSTITUTE SHEET (RULE 26) 



37/95 



*->qesrAqkFlrQHiDspktsssripnYCNqMMdkrRnmtqgrCKpvNTF 
+ ++ q+F++QH+ ++s + CN +M k++n rCK+ NTF 
32 GMTSSQWFKIQHM- - -QPSPQA- - -CNSAM-KNINKHTKRCKDLNTF 71 



vHesl adVkaVCsqkNvtCkNGqkNCyqSkssfqiTdCrl tggsqkyPnC 
+He++++V a C ++ + CkNG kNC+qS+ ++++T C+lt+g yPnC 
72 LHEPFSSVMTCQTPKIACKNGDKNCHQSHGPVSLTMCKLTSGK- -YPNC 119 



rYrtsastkhl i VACEgrd . rddPyynPyvPVHFDasv< - * 
rY+ + ++k ++VAC 1 1 i -H -d+ ++ vPVH+D++ 
120 RYKEKRQNKSYVVACKPPQkKDSQQFH - LVPVHLDRVL 156 

FIG. 20 
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Input (ile T354Athla42a4; Output File T354Ath!a42a4.pat 
Sequence length 1788 

M P L L 4 

GTCGACCCACGGCGTCCGGCCAGGCTCCACTGAGGGGAACGGGGACCTGTCTGAAGAGAAG ATG CCC CTG CTG 73 

TLYLL LFWLSGYSI ATQI TG 24 

ACA CTC TAC CTG CTC CTC TTC TGG CTC TCA GGC TAC TCC ATT GCC ACT CAA ATC ACC GGT 133 

P T TVN GLERGSLTVQCVYRS 44 

CCA ACA ACA GTG AAT GGC TTG GAG CGG GGC TCC TTG ACC GTG CAG TGT GTT TAC AGA TCA 193 

GWE TYL KW W CR GA I WRDCK I 64 

GGC TGG GAG ACC TAC TTG AAG TGG TGG TGT CGA GGA GCT ATT TGG CGT GAC TGC AAG ATC 253 

L V K T S G S E Q E V K R D R V S 1 K D 84 

CTT GTT AAA ACC AGT GGG TCA GAG CAG GAG GTG AAG AGG GAC CGG GTG TCC ATC AAG GAC 313 

NQ KNR T F TV TMEDtMKTDAD 104 

AAT CAG AAA AAC CGC ACG TTC ACT GTG ACC ATG GAG GAT CTC ATG AAA ACT GAT GCT GAC 373 

T Y W C G I E K T G N D L G V T V Q V T 124 

ACT TAC TGG TGT GGA ATT GAG AAA ACT GGA AAT GAC CTT GGG GTC ACA GTT CAA GTG ACC 433 

I D P A S T P A P T T P T S T T F T A P 144 

ATT GAC CCA GCG TCG ACT CCT GCC CCC ACC ACG CCT ACT TCC ACT ACG TTT ACA GCA CCA 493 

VT QE E T SSSP T L TG HH L DN R 164 

GTC ACC CAA GAA GAA ACT AGC AGC TCC CCA ACT CTG ACC GGC CAC CAC TTG GAC AAC AGG 553 

H K L L K L S V L L P L I F T I L L L L 184 

CAC AAG CTC CTG AAG CTC AGT GTC CTC CTG CCC CTC ATC TTC ACC ATA TTG CTG CTG CTT 613 

LV AAS L L AWRMM K Y QQ KA A G 204 

TTG GTG GCC GCC TCA CTC TTG GCT TGG AGG ATG ATG AAG TAC CAG CAG AAA GCA GCC GGG 673 

M S P E Q V L Q P L E G D L C Y A 0 L T 224 

ATG TCC CCA GAG CAG GTA CTG CAG CCC CTG GAG GGC GAC CTC TGC TAT GCA GAC CTG ACC 733 
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L Q L AG T S P R KATT KL S SAQV244 

CTG CAG CTG GCC GGA ACC TCC CCG CGA AAG GCT ACC ACG AAG CTT TCC TCT GCC CAG GTT 793 

DQVEVEYVTMASLPKED1 S Y 264 

GAC CAG GTG GAA GTG GAA TAT GTC ACC ATG GCT TCC TTG CCG AAG GAG GAC ATT TCC TAT 853 

ASLT LGAEDQEPTYCNMGHL284 

GCA TCT CTG ACC TTG GGT GCT GAG GAT CAG GAA CCG ACC TAC TGC AAC ATG GGC CAC CTC 913 

SSHLPGRG PEEPTEYSTI S R 304 

AGT AGC CAC CTC CCC GGC AGG GGC CCT GAG GAG CCC ACG GAA TAC AGC ACC ATC AGC AGG 973 

P * 306 

CCT TAG 979 

CCTGCACTCCAGGCTCCTTCTTGGACCCCAGGCTGTGAGCACACTCCTGCCTCATCGACCGTCTGCCCCCTGCTCCCCT 1058 

CATCAGGACCAACCCGGGGACTGGTGCCTCTGCCTGATCAGCCAGCATTGCCCCTAGCTCTGGGTTGGGCTTGGGGCCA 1 1 37 

AGTCTCAGGGGGCTTCTAGGAGTTGGGGTTTTCTAAACGTCCCCTCCTCTCCTACATAGTTGAGGAGGGGGCTAGGGAT 1216 

ATGCTCTGGGGCTTTCATGGGAATGATGAAGATGATAATGAGAAAAATGTTATCATTATTATCATGAAGTACCATTATC 1295 

ATAATACAATGAACCTTTATTTATTGCCTACCACATGTTATGGGCTGAATAATGGCCCCCAAAGATATCTGTGTCCTAA 1374 

TCCTCAGMCTTGTGACTGTTACCTTCTGTGGCAGAAAGGGACAGTGCAGATGTATGTAAGTTAAGGACTTTGAGATAG 1453 

AGAGGTTATTCTTGCTGATTCAGGTGGGCCCAAAATATCACCACAAGGGTCCTCATAAGAAAGAGGCGAGAAGGTCAAA 1532 

GAGGTAGAGACAAAGTGATGATGGAAGTGGACGTGGGTGTGACGTGAGCAGGGGCCATGAATGCCGCAGCCTTCAGATG 1611 

CCAGAAAGGGAAAGGAATGGATTCCCCTGCCTGGAGCCTCCAAAAGAAACCAGCCCTGCCCACGCCTTGACTTGAGCCC 1690 

ATTGAAACTGATCTTGAGCTCCTGGCCTCCAGAATTGCAGGAGAATAAATTTGTGTTGTTTTTAAAAAAAAAAAAAAAA 1769 

AAAAAGGGCGGCCGCTAGA 1788 
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T354 



Cys 

Ngly 

out 

TM 
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1 41 81 121 161 201 241 281 
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*->Gesvtl_tCsvsgfgppgvsvtWyf kngk. Igpsl 1 gysysrl 

-H-s+t +C ++ + + +++ W+ ++ ++ k 1 ++ s + 
33 RGSLTVQCVYR- -SGWETYLKWWCrgai wRDCKi LVK- -TSGSEQEV 75 

esgekanl segrf si s. . si tLti ssvekeDsGtYtCvv<-* 

++ r+si 1 1 1 1 1 1 1 1 t+t+ ++ k D+ tY+C 

76 KRD • - - RVSIKdnqknrTFTVTMEDLMKTDADTYWCGI 110 
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Input file T378Athta28f4; Output File T378Athta28f4.pat 
Sequence length 3258 



MDHCGALFL 9 

CACGCGTCCGGCCAGTTCTTGGAGGAGACTCTGCACAGGGC ATG GAT CAC TGT GGT GCC CTT TTC CTG 68 

C L C L L T L Q N A T T E T W E E L L S 29 

TGC CTG TGC CTT CTG ACT TTG CAG AAT GCA ACA ACA GAG ACA TGG GAA GAA CTC CTG AGC 128 

Y M ENMQV SRG RSSVFSSRQL 49 

TAC ATG GAG AAT ATG CAG GTG TCC AGG GGC CGG AGC TCA GTT TTT TCC TCT CGT CAA CTC 188 

HQ L EQM L L NT S F P GYN L T L Q 69 

CAC CAG CTG GAG CAG ATG CTA CTG AAC ACC AGC TTC CCA GGC TAC AAC CTG ACC TTG CAG 248 

T P T I QSLA FKLSCOFSG LSL 89 

ACA CCC ACC ATC CAG TCT CTG GCC TTC AAG CTG AGC TGT GAC TTC TCT GGC CTC TCG CTG 308 

T S A T L K RV P QAGGQHA RG Q H 109 

ACC AGT GCC ACT CTG AAG CGG GTG CCC CAG GCA GGA GGT CAG CAT GCC CGG GGT CAG CAC 368 

AMQ FPAEL TRDACK TRPRE L 129 

GCC ATG CAG TTC CCC GCC GAG CTG ACC CGG GAC GCC TGC AAG ACC CGC CCC AGG GAG CTG 428 

RL I CIYFSNTHFF KDENN S S 149 

CGG CTC ATC TGT ATC TAC TTC TCC AAC ACC CAC TTT TTC AAG GAT GAA AAC AAC TCA TCT 488 

L L NNYV LGAQL SHG HVNNL R 169 

CTG CTG AAT AAC TAC GTC CTG GGG GCC CAG CTG AGT CAT GGG CAC GTG AAC AAC CTC AGG 548 

DP VN 1 SFWH NQSL EG Y T L T C 189 

GAT CCT GTG AAC ATC AGC TTC TGG CAC AAC CAA AGC CTG GAA GGC TAC ACC CTG ACC TGT 608 

VF WKE G AR KQPWGGW S PEG C 209 

GTC TTC TGG AAG GAG GGA GCC AGG AAA CAG CCC TGG GGG GGC TGG AGC CCT GAG GGC TGT 668 

RT EQPSHSQ VL CRC N H LTYF 229 

CGT ACA GAG CAG CCC TCC CAC TCT CAG GTG CTC TGC CGC TGC AAC CAC CTC ACC TAC TTT 728 
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AVLMQL SPAL V PAE LL APL T 249 

GCT GTT CTC ATG CAA CTC TCC CCA GCC CTG GTC CCT GCA GAG TTG CTG GCA CCT CTT ACG 788 

YISLVGCSISIVASLI TVLL 269 

TAC ATC TCC CTC GTG GGC TGC AGC ATC TCC ATC GTG GCC TCG CTG ATC ACA GTC CTG CTG 848 

HF HFRKQSDSLTR I HMNLHA 289 

CAC TTC CAT TTC AGG AAG CAG AGT GAC TCC TTA ACA CGC ATC CAC ATG AAC CTG CAT GCC 908 

S V L L LN 1AFL LSPAFAMSP V 309 

TCC GTG CTG CTC CTG AAC ATC GCC TTC CTG CTG AGC CCC GCA TTC GCA ATG TCT CCT GTG 968 

P G S A C T A L A A A L H Y A L L S C L 329 

CCC GGG TCA GCA TGC ACG GCT CTG GCC GCT GCC CTG CAC TAC GCG CTG CTC AGC TGC CTC 1028 

TWMA I E GF NLYLL LGRVYN I 349 

ACC TGG ATG GCC ATC GAG GGC TTC AAC CTC TAC CTC CTC CTC GGG CGT GTC TAC AAC ATC 1088 

YI RR YVF KLGVLGWGAPALL 369 

TAC ATC CGC AGA TAT GTG TTC AAG CTT GGT GTG CTA GGC TGG GGG GCC CCA GCC CTC CTG 1148 

VL.LSL SVKSSVYGPCTIPVF 389 

GTG CTG CTT TCC CTC TCT GTC AAG AGC TCG GTA TAC GGA CCC TGC ACA ATC CCC GTC TTC 1208 

D SWENG TGFQ NMS I CWVR SP. 409 

GAC AGC TGG GAG AAT GGC ACA GGC TTC CAG AAC ATG TCC ATA TGC TGG GTG CGG AGC CCC 1268 

VVHSVLVM6YGGL TSL FNLV 429 

GTG GTG CAC AGT GTC CTG GTC ATG GGC TAC GGC GGC CTC ACG TCC CTC TTC AAC CTG GTG 1328 

VL AWALWTLRRLRERADAPS 449 

GTG CTG GCC TGG GCG CTG TGG ACC CTG CGC AGG CTG CGG GAG CGG GCG GAT GCA CCA AGT 1388 

VRACHDTVTVLGL TVL LG T T 469 

GTC AGG GCC TGC CAT GAC ACT GTC ACT GTG CTG GGC CTC ACC GTG CTG CTG GGA ACC ACC 1448 

WALAFFSFGVFLLPQLF LFT 489 

TGG GCC TTG GCC TTC TTT TCT TTT GGC GTC TTC CTG CTG CCC CAG CTG TTC CTC TTC ACC 1508 

FIG.24B 
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ILNSLYGFFLFLWFCSQRCR 509 

ATC TTA AAC TCG CTC TAC GGT TTC TTC CTT TTC CTG TGG TTC TGC TCC CAG CGG TGC CGC 1568 

S E AE AKAQ I E A F SSS Q T T Q * 529 

TCA GAA GCA GAG GCC AAG GCA CAG ATA GAG GCC TTC AGC TCC TCC CAA ACA ACA CAG TAG 1628 

TCCGGGCCTCCTGGCCTGGAATCCTCAGCCTCTCTGGCCGCCAGTAGCCTGAGGCTACGGCTCCTGCTAGAGAGGGTGG 1707 

CAGGCCTGCTGCTGGACCCCAGAGGCCACTGTGACCGCCAAGGGGCCTTTTCCACTTCCACGGCCTCTCCAGGCACTGA 1786 

GGGGAAGGCATTGCTCTACCTCTCCCTGACATTTTGCTCCGGGGCAGATCCAACCTTACCTGGGGCAGCAAACTTTGTC 1865 

CTGGTACCTGGGCCCAGCTCGCCAGGGATGTGGGCAGAGCACCAGCCTGGGCATCAGGAAGCCAAGTTTCAAGGACTGT 1944 

CTTTGAGTCTGTCTGTATGACCTTGGGCCTGCCACTTCTCACAGACCCTAGGTATCCACAGCTGTGACATGGGGGCAAG 2023 

CGGCTTTGTTTCAGCCTAACCCAGGAGCTTAGTAAAAATTGCATAAGACCAGGGGGAAGAGTGTCAGCGTGGGGTGGGA 2102 

ATTCCCGCGGCCTCCACCTGCTTGCTAGGGGCAGGATCTCATTCAGGCTGCCCTGGAAGCACCTGCTTGGCCCTGCCAC 2181 

CTTCCTCCAGGGGAGGGCCAGATGGCATCCTGGCTTGGGGCGGGTGGGACCTACCCAGGCTCTGAGACTTTACTGGCCT 2260 

ATGCCTGAGGCCTCTTTTCCTTTAACTCCCTAAATTATGATGACTCCAAGTCCAAGCCCACCCTTCCCAAAGATTGGGA 2339 

GGTTCCGCCGTTCCCAGAGGCTCCTCCTGCGGTGCTCCCAAGACTTCCATAGACCATCTGGACCAGTAGCCCATCCCGC 2418 

AGTTTTCTTGGGGGCAGAGGAAAACGCTTCTTTCTCCTCCAGCTGAATCAGCTGGATCCCAGTGTCCTGGCTGTTTGGT 2497 

GATTGGGCAAGATTGAATTTGCCCAGGTAGGCGTGAGAGTGTGGGTTTTAAATTCGAAGCTCAGGCCATAGTTTCAGAG 2576 

AATCACCCTTACCCCAGACCTTCATGAGACAGTGCTCATGAAGCCAGTGCGTTTCCCAGAACGAACACTAGGCGGCACC 2655 

GTTGGTCCACACTCAGAGGCCCTTGGCGCCAAGACTGCATCTAGAATCGCTCAAACACCTGTTTGCAGACCCCATGCAC 2734 

CAGCTGGAGGGGCCGTAACTGCAGGACTGCGCCTACTGAGTGACCCATTTCCTCCAGGAGGAAAGGCAAGACACGCTTA 2813 

CACGGCCATTTGTCTCTTTTCCCAATGCGGCGGTGCACTTTCGCTCTTGGGGGCTGCACCCCAGACATAGCTGGCACCA 2892 
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GAGCAGGGTGCTCAGGTGGTGGGTGCTCAGGGCCCTGCCCCAGGCCACTGGGCCGTTTTGATGACCTCGAAGGTCACAG 2971 

GCAGAAAATAGGAGCAGGATTTCCCCTGGGGAMAGTTCTCCTGGGACATCTTCTGCTCTTCTGTACATTTCTAGATGC 3050 

AAATAACTCCTTCACCAGGCAGTGAGTGGCGTAGGCTCTGGAGCCAGGCTGCCTGGGCTCCAATGCCAGCTCTGCCACT 3129 

TGCTAGCTGTGAGACTGTGGACAAACCACTCAGCCTCTGTGTGCCTCAGTTTTCCTATTTGTAAAATAGAGGCCATAGT 3208 

GGTACCTATTTTGAAGACTAAGTAAAAGAATTCAAATAAAGAGACTTGGC 3258 
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*->CnrtWDgitC. .Wpdt ppGel VvvpCPkyfygfssdqtdttgn 

+tC W+ + +++p+G ++ C + +q + + 

187 LTCvf WKEGarkqPWGGWSPEGC RTEQ - - ■ PSH 216 

vsRnCtedGsWsepppsNrtWrnysaCgeddpeeesekkkkyyl vl kiiY 

217 SQVLCRCNH- -LTYFA - VLMQLSPALVPAELLAPLTYIS 252 

tvGYSl SLaaLl vAvvILl 1 FRkLhtl wpdnadgal evgapWGAPf qvr r 

. +vG S+S++a 1+ v++ FRk + + 
253 LVGCSISIVASLITVLLHFHFRKQS DSL : -- 280 

Si rCtRNylHmNLFI SFILrAasvf i kdavl ksevssdeperLssrcsl s 
tR IHmNL +S +L +++ ++ a s v+ ++ 
281 TR- - IHMNLHASVLLLNIAFLLSPAFAMSPVPGSA- ■ ■ - •- 313 

tgqvvvgCkl 1 vvfQfqYcvmtNf fWl LvEGl YLhtLLvvtf f sErkyl w 
C +1 ++ ++Y++++ +W+ +EG L+ LL + ++y + 
314 ...... -CTALAAA- LHYALLSCLTWMAIEGFNLYLLLGRVY- - -NIYIR 352 

wYl . . . . lIGWGvPlVfvtvWaivRnfedtgCWdsnGLAmFPEAKiiiCi W 
Y+ + 4++GWG+P++ v v++ ++ +C++++ F 
353 RYVfkl gVLGWGAPALLVLLSLSVKSSVY-GPCTIPV- - - ■ FDSWENGTG 397 

msdnshl wWI IkgPi LI si 1 V NFflFiniirl LvtKLRaa 

n+++ W+ + P++ s+IV + ++ ++ M i i i i ++ L + LR+ 
398 F - QNMS I CWV ■ RS P V VHSVLVmgyggl t s 1 f N LV VLAWALWTL - RRLRER 444 

qtgetdqrqYsqYrkLaKSTLl LIPLfGIhyvvFaf rPsndarGvl rki k 
+ + + + L L L+G++ + +f+++ v+ + 
445 ADAPSVR ACHDTVTVLGLTVLLGTTWALAFFSFG VFLLPQ 484 

1 yf el sLgSFQGFf VAvl YCF1 NgEVQaEi rrrW< - * 
1++ L+S+ GFf ++ F+ + ++E + 
485 LFLFTILNSLYGFF- - LFLWFCSQRCRSEAEAKA 516 
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10 20 30 40 50 60 70 

inputs ATGACGCCGAGCCCCCTGTTGCTGCTCCTGCTGCCGCC6CTGCTGCTGGGGGCCTTCCCGCCGGCCGCCG 



80 90 100 110 120 130 140 

CCGCCCGAGGCCCCCCAAAGATGGCGGACAAGGTGGTCCCACGGCAGGTGGCCCGGCTGGGCCGCACTGT 

CACG-- 



150 160 170 180 190 200 210 

GCGGCTGCAGTGCCCAGTGGAGGGGGACCCGCCGCCGCTGACCATGTGGACCMGGATGGCCGCACCATC 



220 230 240 250 260 270 280 

Inputs CAGAGCGGCTGGAGCCGCTTCCGCGTGCTGCCGCAGGGGCTGAAGGTGAAGCAGGTGGAGCGGGAGGATG 



290 300 310 320 330 340 350 

inputs CCGGCGTGTACGTGTGCAAGGCCACCAACGGCTTCGGCAGCCTGAGCGTCAACTACACCCTCGTCGTGCT 



360 370 380 390 400 410 420 

inputs GGATGACATTAGCCCAGGGAAGGAGAGCCTGGGGCCCGACAGCTCCTCTGGGGGTCAAGAGGACCCCGCC 



430 440 450 460 470 480 490 

i nputs AGCCAGCAGTGGGCACGACCGCGCTTCACACAGCCCTCCAAGATGAGGCGCCGGGTGATCGCACGGCCCG 

--• ------CGtCCG. - • 

10 

500 510 520 530 540 550 560 

inputs TGGGTAGCTCCGTGCGGCTCAAGTGCGTGGCCAGCGGGCACCCTCGGCCCGACATCACGTGGATGAAGGA 



570 580 590 600 610 620 630 

i nputs CGACCAGGCCTTGACGCGCCCAGAGGCCGCTGAGCCCAGGAAGAAGAAGTGGACACTGAGCCTGAAGAAC 



640 650 660 670 680 690 700 

inputs CTGCGGCCGGAGGACAGCGGCAAATACACCTGCCGCGTGTCGAACCGCGCGGGCGCCATCAACGCCACCT 
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710 720 730 740 750 760 770 

inputs ACMGGTGGATGTGATCCAGCGGACCCGTTCCMGCCCGTGCTCACAGGCACGCACCCCGTGAACACGAC 



780 790 800 810 820 830 840 

inputs GGTGGACTTCGGGGGGACCACGTCCTTCCAGTGCAAGGTGCGCAGCGACGTGAAGCCGGTGATCCAGTGG 



850 860 870 880 890 900 910 

inputs CTGAAGCGCGTGGAGTACGGCGCCGAGGGCCGCCACAACTCCACCATCGATGTGGGCGGCCAGAAGTTTG 



920 930 940 950 960 970 980 

inputs TGGTGCTGCCCACGGGTGACGTGTGGTCGCGGCCCGACGGCTCCTACCTCAATAAGCTGCTCATCACCCG 



------- GCCCACGGGTGATGTGTGGTCACGGCCTGATGGCTCCTACCTCAACAAGCTGCTCATCTCTCG 

20 30 40 50 60 70 

990 1000 1010 1020 1030 1040 1050 
1 nputs TGCCCGCCAGGACGATGCGGGCATGTACATCTGCCTTGGCGCCAACACCATGGGCTACAGCTTCCGCAGC 

GGCCGGCCAGGAT<^iGCTC 

80 90 100 110 120 130 140 

1060 1070 1080 1090 1100 1110 1120 
inputs GCCTTCCTCACCGTGCTGCCAGACCCAAAACCGCCAGGGCCACCTGTGGCCTCCTCGTCCTCGGCCACTA 

GCCTTtCTCACTCT 

150 160 170 180 190 200 210 

1130 1140 1150 1160 1170 1180 1190 
inputs GCCTGCCGTGGCCCGTGGTCATCGGCATCCCAGCCGGCGCTGTCTTCATCCTGGGCACCCTGCTCCTGTG 

GCCtGCCAtGGCCTGtG^^ 

220 230 240 250 260 270 280 

1200 1210 1220 1230 1240 1250 1260 
inputs GCnTGCCAGGCCCAGAAGAAGCCGTGCACCCCCGCGCCTGCCCCTCCCCTGCCTGGGCACCGCCCGCCG 

GCtiTGCCAGACCMGMGMGCCAtGTGCCCC 

290 300 310 320 330 340 350 

1270 1280 1290 1300 1310 1320 1330 
inputs GGGACGGCCCGCGACCGCAGCGGAGACAAGGACCTTCCCTCGTTGGCCGCCCTCAGCGCTGGCCCTGGTG 



GGGACATCCCGAGAACGCAGTGGTGACAAGGACCTGCCCTCATTGGC TG 

360 370 380 390 400 

1340 1350 1360 1370 1380 1390 1400 
inputs TGGGGCTGTGTGAGGAGCATGGGTCTCCGGCAGCCCCCCAGCACTTACTGGGCCCAGGCCCAGTTGCTGG 

tGGGCAtAtGtGAGGAGCAtGGAtcCGCCATGGCCC^ 

410 420 430 440 450 460 470 
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1410 1420 1430 1440 1450 1460 

inputs CCCTAAGTTGTACCCCAMCTCTACACAGACATCCACACACACACACA- -CACACAC- -TCTCACACACA 



CCCCAAGCTGTACCCCMGCTATACACAGATGTGCACACACACACACATACACACACCTGCACTCACACG 
480 490 500 510 520 530 540 

1470 1480 1490 1500 1510 
inputs CTCACACGT-GGAGGGCAAGGT-C- - - - - CACCAGCACATCCACTATCAGTGC - 



CTCTCATGTTGGAGGGCAAGGTTCATCAACACCAGCATGTCCACTATCAGTGCTAAATACAGCGAATCTC 
550 560 570 580 590 600 610 

inputs ----- -- 

CAAGCACTGTGTCC 
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970 980 990 1000 1010 1020 1030 

GT6CTGCCCAC6GGT6ACGTGTGGTCGCGGGCGGACG6CTCCTACCTCAATAAGCTSCTCATCACCCCTG 

GtCCGGCCCACGGGTGA^ 

10 20 30 40 50 60 70 

1040 1050 1060 1070 1080 1090 1100 
CCCGCCAGGACGATGCGGGCATGTACATCTGCCTTGGCGCCAACACCATGGGCTACAGCTTCCGCAGCGA 

CCCGCCAG&TGATGCT^ 

80 90 100 110 120 130 140 

1110 1120 1130 1140 1150 1160 1170 
CTTCCTCACCGTGCTGCCAGACCCAAAACCGCCAGGGCCACCTGTGGCCTCCTCGTCCTCGGGCACTAGC 



CTTCCTCACTGTATTACCAGACCCCAAACCTCCAGGGCCTCCTATGGCTTCTTCATCGTCATCCACAAGC 
150 160 170 180 190 200 210 

1180 1190 1200 1210 1220 1230 1240 
CTGGCGTGGCCCGTGGTCATCGGCATCCCAGCCGGCGCTGTCTTCATCCTGGGCACCCTGCTCCTGTGGC 



CTGCCATGGCCTGTGGTGATCGGCATCCCAGCTGGTGCTGTCTTCATCCTAGGCACTGTGCTGCTCTGGC 
220 230 240 250 260 270 280 

1250 1260 1270 1280 1290 1300 1310 
TTTGCCAGGCCCAGAAGAAGCCGTGCACCCCCGCGCCTGCCCCTCCCCTGCCTGGGCACCGCCCGCCGGG 

liTGCCAGACCMG^ 

290 300 310 320 330 340 350 

1320 1330 1340 1350 1360 1370 1380 
GACGGCCCGCGACCGCAGCGGAGACAAGGACCTTCCCTCGTTGGCCGCCCTCAGCGCTGGCCCTGGTGTG 

GACATCCCGAGAACGCA^ - - - TCTC 

360 370 380 390 400 

1390 1400 1410 1420 1430 1440 1450 
GGGCTGTGTGAGGAGCAT6GGTCTCCGGCAGCCCCCCAGCACTTACTGGGCCCAGGCCCAGTTGCTGGCC 

GGCAtAtGtGAGGAGCAt^ 

410 420 430 440 450 460 470 

1460 1470 1480 1490 1500 1510 1520 

CTMGTTGTACCCCAAACTCTACACAGACATCCACACACACACACA- - CACACAC - -TCTCACACACACT 

CCMGCtGtACCCCAAGCtAtACACAGATGtGCACAC^ 

480 490 500 510 520 530 540 

1530 1540 1550 1560 1570 1580 

CACACGT - GGAGGGCAAGGT-C CACCAGCACATCCACTATCAGTGCTAGACGGCACCGTATCTGC 

CTCATGiTGGAGGGCMGGTTCATCMCACCAGCATGta 

550 560 570 580 590 600 610 

1590 1600 1610 1620 1630 1640 1650 
AGTGGGCACGGGGGGGCCGGCCAGACAGGCAGACTGGGAGGATGGAGGACGGAGCTGCAGACGAAGGCAG 

AA- • -GCACTGTGT CCTGA- -GGTAGGCAT- • ■ • ■ ttGGGGGCC AAGGCAACAG - -GttGG- -G 

620 630 640 650 660 
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1660 1670 1680 1690 1700 1710 1720 
GGGACCGATGGCGAGGAGGAATGGCCA6CACCCCA6GCAGTCTGT6TGTGAGGCATAGCCCCT6GACACA 

AGAAnGAGAAtAATGGAGGAAG- - -AGTATCTTAGGGtGCtt tAtGGTGGACA- - -CTCACAAACTTG 
670 680 690 700 710 720 

1730 1740 1750 1760 1770 1780 1790 

CACACACAGACACACACACTGCCTGGA-TGCATGTATGCACACACATGCGCGCACACGTGCTCCCTGAAG 

GCCATATAI^TGTATGTACTA^^ - GTGT 

730 740 750 760 ' 770 780 790 

1800 1810 1820 1830 1840 1850 1860 

GCACACGTACGCA - CA - CACGCACATGCACAGATATGCCGCCTGGGCACACAGATAAGCTGCCCAAATGC 

AAACCTGtGCACAACtGCACACACAA - C • Ctl^GAMCCTTCAGI^GGAiTrGf GGTC - TCAC - - TTTGC 
800 810 820 830 840 850 860 

1870 1880 1890 1900 1910 1920 1930 

acgcacacgca - cagagacatgccagmcatacaaggacatg - ctgcctgaacata - - cacacgcacacc 
agtgacatctagc&tcg™^ 

870 880 890 900 910 920 930 

1940 1950 1960 1970 1980 1990 

CATGCGCAGATGTG — - CTGCCTGGACACACACACACACACGGATATGCTGTCTGGACGCACACACGTGC 

CCtGCCfetGtGTTCCtffi 

940 950 960 970 980 990 

2000 2010 2020 2030 2040 2050 2060 
AGATATGGTATCCGGACACACACGTGCACAGATATGCTGCCTGGACACACAGATAATGCTGCCTTGACAC 

C- ■ -CtATCAACCTGACtGGGGTGAGdA GtGCAGCCATGCNtGGAGGitfGAGCCACC- - - - CTC 

1000 1010 1020 1030 1040 1050 

2070 2080 
ACACATGCACGGATATTG 

CC-CTTGCtAGAGAGAAG 
1060 1070 

FIG.28B 
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10 20 30 40 50 60 70 

inputs MTPSPLLLLLLPPLLLGAFPPAAAARGPPKMADKVVPRQVARLGRTVRLQCPVEGDPPPLTMWTKDGRTI 



80 90 100 110 120 130 140 

inputs HSGWSRFRVLPQGLKVKQVEREDAGVYVCKATNGFGSLSVNYTLVVLDDISPGKESLGPDSSSGGQEDPA 



150 160 170 180 190 200 210 

inputs SQQWARPRFTQPSKMRRRVIARPVGSSVRLKCVASGHPRPDITOMKDDQALTRPEAAEPRKKKWTLSLKN 



220 230 240 250 260 270 280 

inputs LRPEDSGKYTCRVSNRAGAINATYKVDVIQRTRSKPVLTGTHPVNTTVDFGGTTSFQCKVRSDVKPVIQW 



290 300 310 320 330 340 350 

i nputs LKRVEYGAEGRHNSTIDVGGQKFVVLPTGDVWSRPDGSYLNK.LLITRARQDDAGMYICLGANTMGYSFRS 



PTGDVWSRPDGSYLNKLLISRARQDDAGMYICLGANTMGYSFRS 

10 20 30 40 

360 370 380 390 400 410 420 

inputs AFLTVLPDPKPPGPPVASSSSATSLPWPVVIGIPAGAVFILGTLLLWLCQAQKKPCTPAPAPPLPGHRPP 



AFLTVLPDPKPPGPPMASSSSSTSLPWPVVIGIPAGAVFILGTVLLWLCQTKKKPCAPASTLPVPGHRPP 
50 60 70 80 90 100 110 

430 ' 440 450 460 470 480 

i nputs GTARDRSGDKDLPSLAALSAGPGVGLCEEHGSPAAPQHLLGPGPVAGPKLYPKLYTD IHTHTHTHSHTH - 



GTSRERSGDKDLPSLA- - • - • - - VGICEEHGSAMAPQHI LASGSTAGPKLYPKLYTDVHTHTHTHTCTHT 
120 130 140 150 160 170 180 

490 500 
inputs SHVEGKVHQHIHYQC 

LSCWRARFINTSMSTISAKYSESPSTVS 
190 200 
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Inputs GT- 



ATGTCACCGCCTCTGTGTCCCCTCCTTCTCCTGGCTGT6GGCCT6C6GCTGGCTGGAACTCTCAACCCCA 
10 20 30 40 50 60 70 



i nputs 



GTGATCCCMTACCTGCAGCTTCTGGGAMGCTTCACTACCACCACCAAGGAGTCCCACTCCCGCCCCTT 
80 90 100 110 120 130 140 



i nputs 



CAGCCTGCTCCCCTCAGAGCCCTGCGAGCGGCCCTGGGAGGGCCCCCATACTTGCCCCAGCCCACAAACT 
150 160 170 180 190 200 210 



i nputs 



CAGAGGAAACTCCTGGCTTCTAGGGATTCATTCTGGATGGTCTGTGTCGGGGCTGGAGTGCAGTGGCGAG 
220 230 240 250 260 270 280 



1 nputs 



ATCGTAGTGCACTGCMCCTCAAACAGGGAATGCGCTTTCTATGCGCCCTCAGCCCAGAGTGTTGAGTGG 
290 300 310 320 330 340 350 



i nputs 



TGCCCCTTCCCTGGCCTCCCCTGGCCACACTGTGGTGGTGAAGACGGACCACCGCCAGCGCCTGCAGTGC 
360 370 380 390 400 410 420 



i nputs 



TGCCATGGCTTCTATGAGAGCAGGGGGTTCTGTGTCCCGCTCTGTGCCCAGGAGTGTGTCCATGGCCGTT 
430 440 450 460 470 480 490 



inputs 



GTGTGGCACCCAATCAGTGCCAATGTGTGCCAGGCTGGCGGGGCGACGACTGTTCCAGTGCCCCGAACTG 
500 510 520 530 540 550 560 



i nputs 



CCTTCAGCCCTGTACCCCTGGCTACTATGGCCCTGCCTGCCAGTTCCGCTGCCAGTGCCATGGGGCACCC 
570 580 590 600 610 620 630 



i nputs 



TGCGATCCCCAGACTGGAGCCTGCTTCTGCCCCGCAGAGAGAACTGGGCCCAGCTGTGACGTGTCCTGTT 
640 650 660 670 680 690 700 
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inputs 

CCCAG6GCACTTCTGGCTTCTTCTGCCCCAGCACCCATCCTTGCCAAMTGGAGGTGTCTTCCAAACCCC 
710 720 730 740 750 760 770 

inputs -- - 

ACAGGGCTCCTGCAGCTGCCCCCCTGGCTGGATGGGCACCATCTGCTCCCTGCCCTGCCCAGAGGGCTTT 
780 790 800 810 820 830 840 

i nputs 

CACGGACCCAACTGCTCCCAGGAATGTGGCTGCCACAACGGCGGCCTCTGTGACCGATTCACTGGGCAGT 
850 860 870 880 890 900 910 

inputs — - 

GCCGCTGCGCTCCGGGTTACACTGGGGATCGGTGCCGGGAGGAGTGCCCGGTGGGCCGCTTTGGGCAGGA 
920 930 940 950 960 970 980 

inputs - - 

CTGTGCTGAGACGTGCGACTGCGCCCCGGACGCCCGTTGCTTCCCGGCCAACGGCGCATGTCTGTGCGAA 
990 1000 1010 1020 1030 1040 1050 

inputs - 

CACGGCTTCACTGGGGACCGCTGGACGGATCGCCTCTGCCCCGACGGCTTCTACGGTCTCAGCTGCCAGG 
1060 1070 1080 1090 1100 1110 1120 

inputs CGACC - 

CCCCCTGCACCTGCGACCGGGAGCACAGCCTCAGCTGCCACCCGATGAACGGGGAGTGCTCCTGCCTGCC 
1130 1140 1150 • 1160 1170 1180 1190 

10 

inputs - — CACGC 

GGGCTGGGCGGGCCTCCACTGCAACGAGAGCTGCCCGCAGGACACGCATGGGCCAGGGTGCCAGGAGCAC 
1200 1210 1220 1230 1240 1250 1260 

inputs — -- - - - 

TGTCTCTGCCTGCACGGTGGCGTCTGCCAGGCTACCAGCGGCCTCTGTCAGTGCGCGCCGGGTTACACGG 
1270 1280 1290 1300 1310 1320 1330 

inputs - ■-- - 

GCCCTCACTGTGCTAGTCTTTGTCCTCCTGACACCTACGGTGTCAACTGTTCTGCACGCTGCTCATGTGA 
1340 1350 1360 1370 1380 1390 1400 
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inputs - - 

AMT6CCATCGCCTGCTCACCCATCGACGGCGAGTGCGTCTGCMGGM 



i nputs 

GT< "I480 "1490 1500 " ' i510 'W 1530 1540 



TCTGTGCCCTGCCCACCCGGMCCTGGGGCnCAGTTGCMTGCCAGCTGCCAGTG^ 



1 nputs 



TCTGCAGCGGCCAAACTGGAGCCTGTACCTGCACCCCTGGGTGGCATGGGGCCCACTGCCAGCTGCCCTG 
1550 1560 1570 1580 1590 1600 1610 

20 

inputs TCCG - ^^^T 

tc£GMGGGGCAGTTTGGAGMGGTTGTGCCAGTCGCTGTGACTGTGACCACTCTGATGGCTGtGACCC^ 
1620 1630 1640 



1650 1660 1670 1680 

30 40 50 60 70 80 90 

inputs GTTCATGGACAGTGCCGATGTCAGGCTGGTTGGATGGGCACACGCTGCCACCTGCCnGCCCGGAGG 

GttCAtGGACGCtGTCAGTGCCAGGCtGGCtGGAtGGGTGCCCGCT 

1690 1700 1710 1720 1730 1740 1750 

100 110 120 130 140 150 160 

inputs TTTGGGGAGCCMCTGCAGTAACACCTGTACCTG 

tAt6GG(^(j^ 

170 180 190 200 210 220 230 

inputs CTGCGTGTGCGCACCAGI^TTCCGAGGCCCCTC 

CtGCGtGtGTGCACCCG&TO 

1830 1840 1850 1860 1870 1880 1890 

inputs CGCTGTGTGCMTGCMCT 

CGCtGtGtGCGCtGCMGTG- - -CGCTMCCACtCCiTCtGCCAC^ 

1900 1910 1920 1930 1940 1950 

310 320 330 340 350 360 370 

i nputs TGGCGGGCTGGACAGGCCCTGACTGCTCCGAGGCATGTCCCCCAGGCCACTGGGGACTCAAATGCTCCCA 

f&£TGGCtMAeAGG^ 

1960 1970 1980 1990 2000 2010 2020 

380 390 400 410 420 430 440 

inputs ACTCTGCCAGTGTCATCATGGTGGGACCTGCCACCCCCAGGATGGGAGCTGTATCTGCACGCCAGG 

GACCtGCCMtGtcACCAt^ 

2030 2040 2050 2060 2070 2080 2090 
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450 460 470 480 490 500 510 

inputs ACTC6ACCCAACTCOT 

ACtGGACACCACtGCttAGMGGCtGCCCtCTGGGGAC^ 

2100 2110 2120 2130 2140 2150 2160 
inputs GTGATCTCGGAGAGATGTG^ 



2170 2180 2190 2200 2210 2220 2230 

590 600 610 620 630 640 650 

inputs CTGCAAAATGGGAAGCCAGGAGTCCTTCACCATAATGCCCACCTCT 

TtGCAGGAtTGGMTddA^ 
2240 2250 2260 2270 2280 2290 2300 

1 nputs GCAGTGATTGGCA^TGCAGTACTGGGMCCCTCGTGGTGGCC^ 



2310 2320 2330 2340 2350 2360 2370 

730 740 750 760 770 780 790 

Inputs AGTGGCAAAAGGGCAAGGAACATGAGCACTTGGCAGTGGCTTACAGCACTGGGCGGCTGGATGGCTCTGA 



2380 2390 2400 2410 2420 2430 2440 

800 810- 820 830 840 850 860 

i nputs TTACGTCATGCCAGATGTCTCTCCGAGCTATAGTCACTACTACTCCMCCCCAGCTACCACACAC^ 

GtATGteAtGCCAGAta 
2450 2460 2470 2480 2490 2500 2510 

870 880 890 900 910 920 930 

inputs CAGTGTTCTCCTMCCCCCCGCCCCCTMCMGGTCCCAGGCAGTCAG 

CAGtGCtCCCCAMCdCCCCAcCCCCf AACAAGGnCCAGGC - - - CCGCtcfttGCCAGCCTGCAGAACC 
2520 2530 2540 2550 2560 2570 2580 

940 950 960 970 980 990 1000 

inputs CTGAGCGGCCMGCAGAGCCCACGGGCGTGAGMCCATACCACACTGCCCGCTGACTGGMGCACCGC^ 

Ct&AGCGGCCAGGTGGG^ 

2590 2600 2610 2620 2630 2640 2650 

1010 1020 1030 1040 1050 1160 

i nputs GGAGCCCCAT- - - - - GACAGAGGCGCCAGCCACCTGGACCGAAGCTATAGCTGTAGCTATAGC 



2660 2670 2680 2690 2700 2710 2720 

1070 1080 1090 1100 1110 1120 1130 

inputs CACAGGMTGGCCCAGGACCATTCTG^ 

MtGGCCCAGGCCCATOtACGAt^ 

2730 2740 2750 2760 2770 2780 
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1140 1150 1160 1170 1180 1190 1200 
inputs TGTCCCTSAGCAGTGAGAACCCCTATGCTACCATCC6AGACCTGCCCA6CCTGCCT6G6GAACCCCGAGA 

CTTCCCTGAGCAGTGAGAACCCATATGCCACCATC^ 
2790 2800 2810 2820 2830 2840 2850 

1210 1220 1230 1240 1250 1260 1270 

inputs AAGTGGCTATGTGGAGATGAAAGGACCTCCATCAGTGTCCCCTCCCAGGCAGTCTCTTCATCTCCGGGAC 

GAGCAGCTACATGGAGATGAMGGCCCTCCCTC 
2860 2870 2880 2890 2900 2910 2920 

1280 1290 1300 1310 1320 1330 1340 
i nputs AGGCAG- - -CAGCGGCAACTGCAGCCACAGAGGGACAGCGGCACCTATGAGCAGCCCAGCCCCTTGAGCC 

AGCCAGAGGCGGCGGCMCCCCAGCCACAGAGAGACAGTGGCACCTAC^GCAGCCCAGCCCCCTGATCC 
2930 2940 2950 2960 2970 2980 2990 

1350 1360 1370 1380 1390 1400 1410 
inputs ATMTGMGAGTCTTTGGGCTCCACGCCCCCGCTTCCTCCAGGCCTGCCTCCTGGTCACTACGACTCCCC 

ATGACCGAGAcfcTGTGGGCTCCCAGCCCCCTCTO 
3000 3010 3020 3030 3040 3050 3060 

1420 1430 1440 1450 1460 1470 1480 
inputs CAAGAACAGCCATATCCCTGGACACTATGACTTGCCTCCAGTACGGCATCCTCCATCCCCTCCATCCCGG 

CAAGMCAGCCACATCCCTGGACATTATGACTTGCCT 
3070 3080 3090 3100 3110 3120 3130 

1490 

inputs CGCCAGGACCGC 



CGCCAGGACCGT 
3140 3150 
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1890 1900 1910 1920 1930 1940 1950 
GACCACTCTGATGGCTGTGACCCTGTTCATGGACGCTGTCAGTGCCAGGCTGGCT6GATGGGTGCCCGCT 

6ACC -CAC - GCGTCCGGt^CCCTGftcAtGGACAGtlX 

10 20 30 40 50 60 70 

1960 1970 1980 1990 2000 2010 2020 
GCCACCTGTCCTGCCCTM 

GCCACCTGCCTtGCCCGGA^^ 

80 90 100 110 120 130 140 

2030 2040 2050 2060 2070 2080 2090 
CACCTGTCTCCCTGAGAATGGCAACTGCGTGTGTGCACCCGGATTCCGGGGCCCCT 

TACCtGtGtGTCtGAGAAtGGCMCtGGGfGtGCGCAC 

150 160 170 180 190 200 210 

2100 2110 2120 2130 2140 2150 2160 
TGTCAGCCTGGCCGCTATGGCAAACGCTGTGTGCCCTGCAAGTG ■ - - CGCTAACCACTCCTTCTGCCACC 



TGCCCGCCTGGTCGCTATGGCAAACGC^ 



220 230 240 250 260 270 280 

2170 2180 2190 2200 2210 2220 2230 
CCTC6MCGGGACGTGCTACTGCCTGGCTGGCTGGACAGGCCCCGACTGCTCCCAGCCATGCCCTCCAGG 

CATCGGACGGGACCtGCTCCTC^ 

290 300 310 320 330 340 350 

2240 2250 2260 '2270 2280 2290 2300 
ACACTGGGGAGAAAACTGTGCCCAGACCTGCCAATGTCACCATGGTGGGACCTGCCATCCCCAGGATGGG 



CCACtGGGGACtCAMTGCT^ 

2310 2320 2330 2340 2350 2360 2370 
AGCTGTATCTGCCCCCTAGGCTGGACTGGACACCACTGCTTAGAAGGCTO 

AGCtGtAtctGCACG^ 

430 440 450 460 470 480 490 

2380 2390 2400 2410 2420 2430 2440 
CTAACTGCTCCCAGCCATGCCAGTGTGGTCCTGGAGAAAAGTGCCACCCAGAGACTGGGGCCTGTGTATG 



TCAACtGCtCCCAGCTAtGTCAGt™ 

500 510 520 530 540 550 560 

2450 2460 2470 2480 2490 2500 2510 
TCCCCCAG&GCACAG^ 

tCCCdCAGGACA^ 

2520 2530 2540 2550 2560 2570 2580 
CCAGTAGCGTATAACTCGCTG^^ 

CCCGTC&CCCAtMctCACtGGCT^ 

640 650 660 670 680 690 700 
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2590 2600 2610 2620 2630 2640 2650 
TGGCACTGTTCATTGGCTA^ 

tAGCACTG^^ 

„ 2660 2670 2680 2690 2700 2710 2720 
CAGCGGGCGGCTGGACGGCTCCGAGTATGTCATGCCAGATGTCCCTCCGAGCTACAGTCACTACTACTCC 



790 800 . 810 820 830 840 

2730 2740 2750 2760 2770 2780 

MCCCCAGCTA^ -C 

MCCCCAGCtACCAcACAC^ 

850 860 870 880 890 900 910 

2790 2800 2810 2820 2830 2840 2850 
CGCTCTTTGCCAGCCTGCAGMCCCTGAGCGGCCAGGTGGGGGCCMGGGC^ 

920 930 940 950 960 970 980 

2860 2870 2880 2890 2900 2910 2920 

GCCTGCTGACTGGAAGCACCGCCGGGAGCCCCCTCCAGGGCCTCTGGACAGGGGGAGCAGCCGCCTGGAC 



GCCCGCfGAcfGGMGCACCGCCGGGAGCCCCAt- - - GACAGAGGCGCCAGCCACCf GGAC 

990 1000 1010 1020 1030 

2930 2940 2950 2960 2970 2980 2990 

CGAAGCTACAGCTATAGCTACAGC - - - ■ •AATGGCCCAGGCCCATTCTACGATAAAGGGCTGATCTCTG 



1040 1050 1060 1070 1080 1090 1100 

3000 3010 3020 3030 3040 3050 3060 
AAGAGGAGCTCGGGGCCAGTGTGGCTTCCCTGAGCAGTGAGAACCCATATGCCACCATCCGGGACCTGCC 



^MGAGG&gAGGGM 

3070 3080 3090 3100 3110 3120 3130 

CAGCTTGCCAGGGGGCCCCCGGGAGAGCAGCTACATGGAGATGAAAGGCCCTCCCTC 

CAGXCTGCCtGGGGMCCC^ 
1180 1190 1200 1210 1220 1230 1240 

3140 3150 3160 3170 3180 3190 3200 
AGGCAGCCTCCTCAGTTTTC^ 

AGGCAGtctctrcAtCTCCGGGAcAGGCAG - - - CAGCGGCAACTGCAGCCAcAGAGGGAcA6CG(^Acet 
1250 1260 1270 1280 1290 1300 . 1310 

3210 3220 3230 3240 3250 3260 3270 
ACI^GCAGCCCAGCCCCCTGATCCATGACCGAGACTCTGTGGGCTCCCAGCCCCCT^ 

Aj&GCAGCCCAGCCCCT^ 

1320 1330 1340 1350 1360 1370 1380 
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3280 3290 3300 3310 3320 3330 3340 
ACCCCCCGGCCACTATGACTCACCCAAGAACAGCCACATCCCT6GACATTATGACTTGCCTCCAGTACGG 



GCCTCCTGGTCACTACGACTCCCCCAAGAACAGCCATATCCCTGGACACTATGACTTGCCTCCAGTACGG 
1390 1400 1410 1420 1430 1440 1450 

3350 3360 3370 3380 3390 3400 3410 
CATCCCCCATCACCTCCACTTCGACGCCA6GACCGTTGAGGAGCGAGGATGGTATGGCAGAGGCCAGCAC 

CAtCCTCCAtCCCCtCCATCCCGGCGCCAGGACCGCtGMGAGC - • - GGAGC 

1460 1470 1480 1490 1500 1510 

3420 3430 3440 3450 3460 3470 3480 
ACCTGGCTGTTGCTGCTCAAGGCTGGGGACAGAGCCTAGTGTACCCCTGCCAG6AGCAGGGAGTGGACCG 

- - GtGCCtA^GtACCT-TGCCAGGAGCAGGGACtGGACCA 

1520 1530 1540 1550 

3490 3500 3510 3520 3530 3540 3550 

GCAGGCTGTGAACATGAACAACGCTTAACAGAGCAAGTGATGG - GAGCCTTGTTCCTGGG - TTCTACCAT 

GCAGGCCACGAACAGAAACA- - -CiTGGTGAAGTGAACAGAGACG^ 

1560 1570 1580 1590 1600 1610 1620 

3560 3570 3580 3590 3600 3610 
GGGAGACGCTGATCAGCAGGATGCCTGGCTCCCTTTCCCAACCCACTGCTCCCMGGCCTCCAGGGC - - - 

GGGAGACACiAGTTGACAM^ 

1630 1640 1650 1660 1670 . 1680 

3620 3630 3640 3650 3660 3670 3680 

- • CCTGTGTACATAAACTGGTGGGTTGGAAGTTGCTGGGTAAC - TCTGATTTCAGACATGCGTGTGGGGT 

AGCTGGtGGGCAGAAtGfrGiTGTACMGtGtGATiTTAGA 
1690 1700 1710 1720 1730 1740 1750 

3690 3700 3710 3720 3730 3740 3750 

ACCTTTTCTGTGC - -ATGCTCAGCCTGGGCTCTGTGCGTGTGTGTGTTTCTGTGATTTTAGAAGGGTACC 

ACCliiTCtGtGTGTAtGCTCAGGCAGG- - -CTGTG- - - -TGtGtcfctAGTtGGCliTAGAGGGAGtCA 
1760 1770 1780 1790 1800 1810 1820 

3760 3770 3780 3790 3800 3810 3820 
AG - GCAGGTTCTGTCCTA6GGCACTTACCATTTAGTAGGGAGATGGAACCAACCCAATTAACTCTAGCAA 

GGTATAGGTTCTG - CCTTCTGCACtiTCCATCiTATCTAGTAGtCAG - • CTTCCAAGCTTA • ACtAGTTA 
1830 1840 1850 1860 1870 1880 

3830 3840 3850 3860 3870 3880 3890 
TAGCCTCGTMCTGGCCTCCTCCATTGATTCAGTGMCCTTCCMTGCATGGCTCATMTTTCAAAATAC 

GAGC-fCCA CCAGCAGCA- -GGCCCTAACTACCTGCCT GCCC ttCA C 

1890 1900 1910 1920 1930 

3900 3910 3920 • 3930 3940 3950 3960 
AGGCTGGTTAGTTACTCCCTACCTGAMGCCTTCATAGGTGCCTCTTTGCTCTTCTGCCAGTATCAAAAC 

- - -CCAGtAA- -fCCfcCATGTCt- -ttGC- -tCAGAGGA tTGCfC CC CGACTC 

1940 1950 1960 1970 
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3970 3980 3990 4000 4010 4020 4030 
TTTGMGGCCTTAMGGCCCTGCTTTGCCTGGCCCATCTGTCTCTCCASCCTCACCTTGAACTGTGTTC 



TGGTGTTGTCCT CCTGGTACGCCTTG- - - - ACGGTC - CTGCAGTCTC - CCT TTC 

1980 1990 2000 2010 2020 

4040 4050 4060 4070 4080 4090 4100 

CTGTCACTGCACGCCAGTCACACCGGCCTCTAGGTCCTCCTGTAGGCCACTCTTCTTTCTGGCACAGGGA 

CCGTCT -fGCt- -TCATtCTTtC- - ■ CCAGAATGAAGGC - tGtCTGCCACCCtActtCCCAGCCCAGGAA 
2030 2040 2050 . 2060 2070 2080 

4110 4120 4130 4140 4150 4160 4170 
CCTGCACACCTGGAGTGCCCTTCCTCC^^ 

TTGGCACATCtAAGTt- - - CAGCCiTCCTMGiTACCCOTGAGTCCT^ - CACATATTCC 

2090 2100 2110 2120 2130 2140 2150 

4180 4190 4200 4210 4220 4230 4240 
TCAGGGAAGTGCCCACCCTCCGTACATCTTTCACAGCCCTGATTGCAGCTGTGTTCACTCACCAGGTACC 

AcagaacA- - -cccACc- -cc- -AcAfcf - - -Genre- - - -AtAGctActctcttctc-cAc- • -gtAcc 

2160 2170 2180 2190 2200 

4250 4260 4270 4280 4290 4300 4310 
TGCAGAAGGCCTACAGGGTGCCAGGCACTTCTTTMTGGGTTCTTTCTTTAT 

CAcAl^GGCAGMGTGGtACCAGGCAAGA- ^GAtGGGAttGttGCAttttGt- •itTGitTtfGAGAc 
2210 2220 2230 2240 2250 2260 2270 

4320 4330 4340 4350 4360 4370 4380 

TCTGCCTCCCCCACTAGACTGTMGCTCCCTGAAGGCAAGAATCCTG - - TGCTTATGCTCMTATTAGCT 

tctGTCtCACTAtGTAGtCCTGGC^^ 

2280 2290 2300 2310 2320 2330 2340 

4390 4400 4410 4420 4430 4440 

CTCCCTT- -GGCACAGAGT- • ■AGGCACTCAACAAA-TGCTCCCCAAAAGGCTGAGTGGCTGACTGMTT 

GGGTlTMCGGCtcAGGGtCACATGCAcAGCtCMGCtGCACTa - -tTCC- -CCtGttGC 

2350 2360 2370 2380 2390 2400 

4450 4460 4470 4480 4490 4500 4510 
AAGTACCAGTGACATGCAGTAACTGCTAAGATAGATGAGCCATCTGTATGCTCTGACAGTTACAG-ACTG 

tAGAf • - - -tAGCGt-CtGCCtCCCCCtAG^GGAGAGGCTGATCGCCAGCtct- -CtGAtGCAGGActC 
2410 2420 2430 2440 2450 2460 

4520 4530 4540 4550 4560 4570 4580 

MTMGTTGGAGACT-TCCCTAMGGGTGGCATTTCCCCAGGGTAACMCGCAGAGCTCAGGTGTGGGAA 

tGGf GTtfAGGCf cActCACtAitGGTtT- CCitGGCACAGGGtAGTCACTCAAf A- -AATGtTCCTCtA 
2470 2480 2490 2500 2510 2520 2530 

4590 4600 4610 
GGTGCCAGGGGCAGGGGTGCAGAGGGGGTGAGGC 



AAAGCTGAAAAAAAAAAAAAAAAAGGGCGGCCGC 
2540 2550 2560 
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10 20 30 40 50 60 70 

inputs MSPPLCPLLLWVGLRLAGTLNPSDPNTCSFWESFTTTTKESHSRPFSLLPSEPCERPWEGPHTCPSPQT 



80 90 100 110 120 130 140 

inputs QRKLLASRDSFCMVCVGAGVQWRDRSALQPQTGNALSMRPQPRVLSGAPSLASPGHTVVVKTDHRQRLQC 



150 160 170 180 190 200 210 

inputs CHGFYESRGFCVPLCAQECVHGRCVAPNQCQCVPGWRGDDCSSAPNCLQPCTPGYYGPACQFRCQCHGAP 



220 230 240 250 260 270 280 

inputs CDPQTGACFCPAERTGPSCDVSCSQGTSGFFCPSTHPCQNGGVFQTPQGSCSCPPGWMGTICSLPCPEGF 



290 300 310 320 330 340 350 

inputs HGPNCSQECRCHNGGLCDRFTGQCRCAPGYTGDRCREECPVGRFGQDCAETCDCAPDARCFPANGACLCE 



360 370 380 390 400 410 420 

inputs HGFTGDRCTDRLCPDGFYGLSCQAPCTCDREHSLSCHPMNGECSCLPGWAGLHCNESCPQDTHGPGCQEH 



•STHASG 



430 440 450 460 470 480 490 

inputs CLCLHGGVCQATSGLCQCAPGYTGPHCASLCPPDTYGVNCSARCSCENAIACSPIDGECVCKEGWQRGNC 



500 510 520 530 540 550 560 

inputs SVPCPPGTWGFSCNASCQCAHEAVCSPQTGACTCTPGWHGAHCQLPCPKGQFGEGCASRCDCDHSDGCDP 



-DP 



570 580 590 600 " 610 620 630 

inputs VHGRCQCQAGWMGARCHLSCPEGLWGVNCSNTCTCKNGGTCLPENGNCVCAPGFRGPSCQRSCQPGRYGK 

VHGQCRCQAGWMGTRC^^ 

10 20 30 40 50 60 70 
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640 650 660 670 680 690 

i nputs RCVPCKCAN ■ HSFCHPSNGTCYCLAGWTGPDCSQPCPPGHWGENCAQTCQCHHG6TCHPQDGSCICPLGW 

RCVQCKCNNNHSSCHPSDGTCSCWGWTGPDCSEACPPGHWGLKCSQLCQC^ 

80 90 100 110 • 120 130 140 

700 710 720 730 ' 740 750 760 

inputs TGHHCLEGCPLGTFGANCSQPCQCGPGEKCHPETGACVCPPGHSGAPCRIGIQEPFTVMPTTPVAYNSLG 



TGPNCLEGCPPRMFGVNCSQLCQCDLGEMCHPETGACVCPPGHSGADCKMGSQESFTIMPTSPVTHNSLG 
150 160 170 180 190 200 210 

770 780 790 800 810 820 830 

inputs AVIGIAVLGSLVVALVALFIGYRHWQKGKEHHHLAVAYSSGRLDGSEYVMPDVPPSYSHYYSNPSYHTLS 



AVIGIAVLGTLVVALIALFIGYRQWQKGKEHEHLAVAYSTGRLDGSDYVMPDVSPSYSHYYSNPSYHTLS 
220 230 240 250 260 270 280 

840 850 860 870 880 890 900 

i nputs QCSPNPPPPNKVPGP - LFASLQNPERPGGAQGHDNHTTLPADWKHRREPPPGPLDRGSSRLDRSYSYSYS 



QCSPNPPPPNKVPGSQLFVSSQAPERPSRAHGRENHTTLPADWKHRREPH- ■ • - DRGASHLDRSYSCSYS 
290 300 310 320 330 3 4 0 350 

910 920 930 940 950 960 970 

inputs --NGPGPFYDKGLISEEELGASVASLSSENPYATIRDLPSLPGGPRESSYMEMKGPPSGSAPRQPPQFWD 



HRNGPGPFCHKGPISEEGLGASVMSLSSENPYATIRDLPSLPGEPRESGYVEMKGPPSVSPPRQSLHLRD 
360 370 380 390 400 410 420 

980 990 1000 1010 1020 1030 1040 

inputs SQRRRQPQPQRDSGTYEQPSPLIHDRDSVGSQPPLPPGLPPGHYDSPKNSHIPGHYDLPPVRHPPSPPLR 

RQQR-QLQPQRDSGTYEQPSPLSHNEK^ 

430 440 450 460 470 480 490 

1050 
inputs RQDR 

RQDR 
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Input file T272Atrxa6b6; Output File T272Alrxa6b6.pat 
Sequence length 3567 

GTCCGACCCACGCGTCCGAGCCACACCCTGAAGGTGGTTGGAAGGAGGGAAGGATCTAGGTCCTGAGCACTGGAATTCC 79 

CCAGAACAGCATCTGGCTTCCCAGACCCATGCTGGCCACCACTGATGTGTCCTTCCGGCTGCTGGCTGCAGTGCTGTTC 1 58 

TGTTGT7GGGTGCCCTGTGGCAGGCTTGTGCAATGCCACTCTGTCCCCTCCTCCTCCTGGCCCTAGGCCTGCGTCTGGC 237 

TGGAACACTCAACTCCAATGATCCCAATGTCTGTACCTTCTGGGAAAGCTTCACCACGACCACTAAGGAGTCCCACCTT 316 

CGCCCCTTCAGCCTGCCCCCAGCCGAGTCCTGCGACAGGCCCTGGGAAGACCCCCACACCTGCGCTCAGCCTACGGTTG 395 

TCTACCGGACTGTGTACCGTCAGGTGGTGAAGATGGACTCCCGCCCACGGCTGCAGTGCTGTGGGGGTTACTACGAGAG 474 

CAGTGGAGCCTGTGTCCCACTCTGTGCCCAGGAGTGTGTCCACGGTCGCTGTGTGGCTCCTAATCGGTGCCAGTGTGCA 553 

CCAGGCTGGCGGGGTGACGACTGTTCCAGTGAGTGTGCTCCTGGAATGTGGGGACCACAGTGTGACAGGCTCTGCCTCT 632 

GTGGCAACAGCAGTTCCTGTGATCCCAGGAGTGGGGTGTGTTTTTGCCCCTCTGGCCTGCAGCCCCCCGACTGCCTTCA 711 

GCCT TGCCCCG ATGGCCACT ATGGTCCTGCCTGCCAGT T TG AT TGCCAT TGCT ATGGGGCATCCTG TGACCCCCGGGAT 790 

GGAGCCTGCTTCTGCCCCCCAGGGAGAACAGGACCCAGGGCACTGATGGCTTCTTCTGCCCCAGAACTTATCCTTGCCA 869 

M G V I C S 6 

AAATGGAGGTGTTCCTCAGGGCTCTCAAGGCTCCTGCAGCTGCCCACCGGGCTGG ATG GGT GTC ATC TGT TCG 942 

LP CPE GFHGPNCTQ ECRCHN 26 

CTG CCA TGC CCA GAG GGT TTC CAC GGA CCC AAC TGT ACT CAG GAA TGT CGT TGC CAC AAT 1002 

GGLC0RFTGQCHCAPGY1G0 46 

GGT GGC CTT TGT GAC AGG TTT ACT GGG CAG TGC CAC TGT GCT CCT GGC TAT ATC GGG GAT 1062 

RCREECPVGRFGGOCAETCO 66 

CGG TGC CGT GAA GAG TGC CCT GTG GGC CGC TTC GGT CAA GAC TGT GCT GAG ACC TGT GAC 1122 

CAPGARCFPANGACLCEHGF 86 

TGT GCT CCT GGC GCT CGT TGC TTT CCT GCC AAT GGC GCG TGT CTG TGC GAA CAT GGC TTC 1182 
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TG D R CTE RLCPDGRYG LSCQ 106 

ACA GGC GAC CGC TGC ACT GAG CGA CTC TGT CCA GAT GGC CGC TAT GGT CTG AGC TGC CAA 1242 

DPCTCDPEH S LSCHPMHGEC 126 

GAT CCC TGC ACC TGC GAC CCA GAA CAC AGT CTC AGC TGC CAC CCA ATG CAC GGC GAG TGC 1302 

SC QPGWAGL HCNE SCP QD T H 146 

TCC TGC CAG CCA GGT TGG GCG GGC CTC CAC TGC AAC GAG AGC TGC CCT CAG GAC ACG CAC 1362 

GAGCQEHCLCLHGGVCLAD S 166 

GGA GCC GGT TGC CAG GAG CAC TGC CTC TGT CTG CAC GGC GGT GTT TGC CTC GCC GAC AGC 1422 

GLCRCAP GY T.GPHCANLCPP 186 

GGC CTC TGC CGG TGT GCA CCT GGC TAC ACG GGA CCT CAC TGC GCT AAT CTT TGT CCA CCT 1482 

N TYG1NCSS HCSCENAIA CS 206 

AAC ACT TAT GGG ATC AAC TGT TCC TCC CAC TGC TCC TGT GAA AAT GCC ATT GCC TGC TCT 1542 

P VDGTC1CKEGWQRG NC SVP 226 

CCT GTC GAC GGC ACG TGC ATC TGC AAG GAA GGT TGG CAG CGT GGT AAC TGC TCT GTG CCC 1602 

C PPG T WGF SCN AS C QC A H E G 246 

TGT CCC CCT GGC ACC TGG GGC TTC AGT TGC AAT GCC AGT TGC CAG TGT GCC CAC GAG GGA 1662 

VCSPQTGACTCTPGWRGVHC 266 

GTC TGC AGC CCC CAA ACT GGA GCC TGT ACT TGC ACC CCT GGG TGG CGT GGG GTT CAC TGC 1722 

QLPCPKGQFGE. GCA SVCDC'D 286 

CAA CTT CCG TGC CCG AAG GGA CAG TTT GGT GAA GGT TGT GCC AGT GTC TGT GAC TGT GAC 1782 

HSDGCDPVHGHCRCQA GWMG 306 

CAC TCC GAT GGC TGT GAC CCT GTT CAT GGA CAC TGC CGA TGT CAG GCT GGC TGG ATG GGC 1842 

TRCHLPCPEGF.fG.AM.CSNAC-.326 

ACA CGT TGC CAC CTG CCT TGC CCA GAG GGC TTT TGG GGA GCC AAC TGC AGC AAT GCC TGT 1902 

T CKNGG TCVPE N GNCVCA P G 346 

ACC TGC AAG AAT GGT GGC ACT TGT GTA CCT GAG AAC GGC AAC TGT GTG TGC GCA CCA GGG 1962 
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FR GPSCQRPC PPGRYG K RC V 366 

TTC AGA GGC CCC TCC TGC CAG AGG CCC TGC CCG CCT GGT CGC TAT GGC AAA CGC TGT GTG 2022 

PCK CNMH SSCHPSOGTCSCL 386 

CCC TGC AAG TGC AAC AAC CAT TCT TCC TGC CAC CCG TCG GAT GGG ACC TGC TCC TGC CTG 2082 

AGWT GPDCSESCPPGHWGLK 406 

GCA GGC TGG ACA GGC CCT GAC TGC TCT GAA TCA TGT CCC CCA GGC CAC TGG GGA CTC AAA 2142 

OS Q P C QCHHG ATCHP Q DG SC 426 

TGC TCC CAA CCC TGC CAG TGT CAT CAT GGT GCC ACC TGC CAC CCC CAG GAT GGG AGC TGT 2202 

VCIPGW TGPNCSEGCPSRMF 446 

GTC TGC ATC CCA GGC TGG ACT GGA CCC AAC TGC TCG GAA GGC TGC CCA TCA AGA ATG TTT 2262 

GVNCSQLCQCDPGE.MCHPET 466 

GGT GTC AAC TGC TCC CAG CTA TGT CAG TGT GAT CCT GGA GAG ATG TGC CAC CCA GAG ACT 2322 

GACVCPPGHSGAHCKVG S'QE 486 

GGG GCT TGC GTC TGT CCC CCA GGA CAC AGT GGT GCG CAC TGC AAA GTG GGC AGC CAG GAG 2382 

SF T1MPTSPV1HNSLGAVIG 506 

TCC TTC ACC ATA ATG CCC ACC TCT CCT GTG ATC CAT AAC TCA CTG GGT GCC GTG ATT GGC 2442 

1AVLGTLVVALVALF 1GYRH 526 

ATT GCA GTG CTG GGG ACC CTT GTG GTG GCC CTG GTA GCA CTG TTT ATT GGC TAC CGA CAC 2502 

WQKGKEHEHLAVAYSTGRLD 546 

TGG CAA AAG GGC AAG GAA CAT GAG CAC TTG GCA GTG GCT TAC AGC ACT GGG CGA CTG GAT 2562 

GS DYVMPDVSPS YSHYYSNP 566 

GGC TCC GAT TAC GTC ATG CCA GAT GTC TCT CCG AGC TAC AGT CAC TAC TAT TCC AAC CCT 2622 

SYHTLSQCSPN PPPP NKIPG 586 

AGC TAC CAC ACA CTG TCT CAG TGT TCT CCT AAC CCT CCA CCC CCT AAC AAG ATT CCA GGC 2682 

SQLFVSSQA SE RPNRNHG RD 606 

AGT CAG CTG TTT GTC AGC TCC CAG GCA TCT GAG CGG CCA AAC AGA AAC CAT GGG CGA GAT 2742 
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NHA TL PADWK HRR E SH ORA F 626 

AAC CAG GCC ACA CTG CCC GOT GAC TGG AAG CAC CGA CGG GAG TCC CAT GAC AGA GCT TTC 2802 

L RH QPPG PKV* 637 

CTC AGG CAC CAG CCA CCT GGA CCG AAG GTA TAG 2835 

CTGTAGCTATGGCCACAGGAATGGCCCGGGGCCATTCTGTCATAAAGGTCCCATCTCTGAAGAAGGACTAGGGGCAAGC 2914 

GTTATGTCCCTGAGCAGTGAGAACCCCTATGCGACCATCCGAGACCTGCCCGGCCTGCCTGGGGAACCCCGAGAAAGCA 2993 

GCTATGTGGAGATGAAAGGCCCTCCATCAGTGTCTCCCCCCAGGCAGCCTCTTCATCTCCGGGACAGGCAGCAGCAGCA 3072 

ACTGCAGTCTCAGAGAGACAGCGGCACCTATGAGCAGCCCACTCCCTTGAGCCGTAATGAAGAGTCTGTGGGCTCCATG 31 51 

CCCCCTCTTCCTCCGGGCCTGCCACCCGGCCACTATGACTCGCCCAAAAACAGCCACATCCCTGGACACTATGACTTGC 3230 

CTCCAGTACGGCATCCTCCATCACCTCCATCCCGGCGCCAGGACCGCTGAGGAGCCAGCATGGTATGGGAGAGTGCCTG 3309 

TGAACCCTGCCAGGAGCAGGGCCTGGACCAGCAGGCCATGAATAGACATACTTGGTGAAGTGAACGGAGACTGAGGATG 3388 

GCTCTGCTTCCACCGAGGGAGACACTAGTTGGCAMGTGTCTAACCTCCCTTTTCCAGCCCATTGCTCAAGTCCCCCAG 3467 

GCTGTGGACATGAGCTGGTGGGCAGAATGTTGTTGTTGAAGTCTGATTTTAGATTGATTTTTTAAAAAAAAAAAAAAAA 3546 

AAAAAAAAAAAGGGCGGCCGC 3567 
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10 20 30 40 50 60 

human GTC-GACCCACGCGTCCGCTCGAA6CGGGGACCCTCGCCCCGTCCTCGGCTGTCCAGTCCTCCTCCTC6C 

rat GTCCGACCCACGCGTCCG AGC- - - - - - ■ CACACCCTGAAGGTGGTTGGAAGG 

10 20 30 40 

70 80 90 100 110 120 130 

human AGACCCCGGCGGTTCCTACCCCAGGCCGCAGGGGAGACGGTGCCCCAAGGCAGGCTTCATA - - TCCTGAA 

rat ' AGG- - - -GAAGGATCTAGGTCCTGAGCACTGG- - - - - - AATTCCCCAGAACAG - CATCTGGCTTCCCAGA 

50 60 70 80 90 100 

140 150 160 170 180 '190 200 

human CGCTGG-GATCCCCCA-GGACATTCCCTGGCCCCCAGGCCCCAGGTCCCAGGCCCCAGGGCTGAGCTGTG 

rat CCCATGCTGGCCACCACTGATGf GTCCTT - - - - CCGG CTG CTGGCTGCAGTGCTGTTCTGTT 

110 120 130 140 150 160 

210 220 230 240 250 260 270 

human GGCAGGCCCCACCTGGCCTCTGCAATGTCACCGCCTCTGTGTCCCCTCCTTCTCCTGGCTGTGGGCCTGC 

rat GTTGGGTGCCCTGTGGCA- -GGCT^ 

170 180 190 200 210 220 230 

280 290 300 310 320 330 340 

human GGCTGGCTGGMCTCTCAACCCCAGTGATCCCMTACCTGCAGCTTCTGGGAAAGCTTCACTACCACCAC 

rat GTCTGGCTGGAACACTCAA^^ 

240 250 260 270 280 290 300 

350 360 370 380 390 400 410 

human CAAGGAGTCCCACTCCCGCCCCTTCAGCCTGCTCCCCTCAGAGCCCTGCGAGCGGCCCTGGGAGGGCCCC 

rat TAAGGAGfcGCACCTTCGCCCCTTCAGCCTGCCCC 

310 320 330 340 350 360 370 

420 430 440 450 460 470 

human CATACTTGC - CCCAGCCCACAAA- - -CT- -CAGA- - - GGAAACTCCTGGCT - TCTAGGGATTCATTCTGC 

rat CACACCTGCGCTCAGCCTACGGTTGTCTACCGGACTGTGTACCGTC 

380 390 400 410 420 430 440 

480 490 500 510 520 530 540 

human ATGGTCTGTGTCGGGGCTG - GAGTGC AGTGGCGAGATC - GTAGTGCACTGC AACCTCAAACAGGGAATGC 

rat CACGCCTG- - -CAGTGCTGTGGGGGTTACTACGAGAGCAGTGGAGC-C - - -TGC 

450 460 470 480 490 500 

550 560 570 ' 580 590 600 610 

human GCTTTCTATGCGCCCTCAGCCCAGAGTGTTGAGTGGTGCCCCTTCCCTG-GCCTCCCCTGGCCACACTGT 

rat CCAGG - AGTGTGTCCACGGTC GCTGTGTG - • GCTCCTAATCGGTGCCAGTGTGCACCAGGCTGG 

510 520 530 540 550 560 
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620 630 640 650 660 670 680 

human GGTGGTGAAGAC6GACCACCGCCAGCGCCTGCAGT6CTGCCATGGCTTCTAT6AGAGCAG6GGGTTCTGT 

rat CGGGGTGACGACTGT TCCAGTG- •AG-TGTGCt-CC-TGGAA- -TGTGGGGACCACAG- - - -TGT 

570 580 590 600 610 

690 700 710 720 730 • 740 750 

human GTCCCGCTCTGTGCCCAGGAGTGTGTCCATGGCCGTTGTGTGGCACCCA - - ATCAGTGCCAATGTGTGCC 

rat GACAGGCT CT G- - - CCTC - - -f GT GGCAACAGCAGTTCCTGTGAtcCCAGGAGTGGGGTGtG 

620 630 640 650 660 670 680 

760 770 780 790 800 810 

human AGGCTGGCGGGGCGACGACTGTTCCAGTGCCCCGAACTGCCTTCAGCCCTGTACCCC - - TGGCTACTATG 

rat CCTCTGGC CTGCAG--CC-- - -CCCCGA-CtGCCttcAGCCTtG- - CCCCGATGGCCACTATG 

690 700 710 720 730 

820 830 840 850 860 870 880 

human GCCCTGCCTGCCAGTTCCGCTGCCAGTGCCATGGGGCACCCTGCGATCCCCAGACTGGAGCCTGCTTCTG 

rat GTCCTGCCiGCCAGTTTGATiGCCA 

740 750 760 770 780 790 800 

J 

890 900 910 920 930 940 950 

human CCCCGCAGAGAGAACTGGGCCCAGCTGTGACGTGTCCTGTTCCCA6GGCACTTCTGGCTTCTTCTGCCCC 



rat CCCCCCAGGGAGAACAGGACCCAG GGCACf GAT GGCftctf CT GCCCC 

810 820 830 840 850 

960 970 980 990 1000 1010 1020 

human AGCACCCATCCTTGCCAAAATGGAGGTGTCTTCCAAACCCCACAGGGCTCCTGCAGCTGCCCCCCTGGCT 

rat AGAACTTATCCTTGCCAAAATGGA^ 

860 870 880 • 890 900 910 920 

1030 1040 1050 1060 1070 1080 1090 
human GGATGGGCACCATCTGCTCCCTGCCCTGCCCAGAGGGCTTTCACGGACCCAACTGCTCCCAGGAATGTCG 



rat GGATGGGTGTCATCTGTTCCCTGCCATGCCCAGAGGGTTTCCACGGACCCAACTGTACTCAGGAATGTCG 
930 940 950 960 970 980 990 

1100 1110 1120 1130 1140 1150 1160 
human CTGCCACAACGGCGGCCTCTGTGACCGATTCACTGGGCAGTGCCGCTGCGCTCCGGGTTACACTGGGGAT 

■rat TTGCCACMTGGTGGCCTTT^ 

1000 1010 1020 1030 1040 1050 1060 

1170 1180 1190 1200 1210 1220 1230 
human CGGTGCCGGGAGGAGTGCCCGGTGGGCCGCTTTGGGCAGGACTGTGCTGAGACGTGCGACTGCGCCCCGG 

rat CGGTGCCGTGMGAGTGCCCTCT^ 

1070 1080 1090 1100 1110 1120 1130 

1240 1250 1260 1270 1280 1290 1300 
human ACGCCCGTTGCTTCCCGGCCAACdGCGCATGTCTGTGCGAACACGGCTTCACTGGGGACCGCTGCACGGA 



rat GCGCTCGTTGCTTTCCTGCCAATGGCGCGTGTCTGTGCGAACATGGCTTCACAGGCGACCGCTGCACTGA 
1140 1150 1160 1170 1180 1190 1200 
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1310 1320 1330 1340 1350 1360 1370 
human TCGCCTCTGCCCCGACGGCTTCTAC6GTCTCAGCTGCCAG6CCCCCT6CACCTGCGACCGGGAGCACAGC 



rat GCGACTCTGTCCAGATGGCCGCTATGGTCTGAGCTGCCAAGATCCCTGCACCTGCGACCCAGAACACAGT 
1210 1220 1230 1240 1250 1260 1270 

1380 1390 1400 . 1410 1420 1430 1440 
human CTCAGCTGCCACCCGATGAACGGGGAGTGCTCCTGCCTGCCGGGCTGGGCGGGCCTCCACTGCAACGAGA 



rat CTCAGCTGCCACCCAATGCACGGCGAGTGCTCCTGCCAGCCAGGTTGGGCGGGCCTCCAGTGCAACGAGA 
1280 1290 1300 1310 1320 1330 1340 

1450 1460 1470 1480 1490 1500 1510 
human GCTGCCCGCAGGACACGCATGGGCCAGGGTGCCAGGAGCACTGTCTCTGCCTGCACGGTGGCGTCTGCCA 



rat GCTGCCCTCAGGACACGCACGGAGCCGGTTGCCAGGAGCACTGCCTCTGTCTGCACGGCGGTGTTTGCCT 
1350 1360 1370 1380 1390 1400 1410 

1520 1530 1540 1550 1560 1570 1580 
human GGCTACCAGCGGCCTCTGTCAGTGCGCGCCGGGTTACACGGGCCCTCACTGTGCTAGTCTTTGTCCTCCT 

rat CGCCG&CAGCGGCCTCTGCCG^ 

1420 1430 1440 1450 1460 1470 1480 

1590 1600 1610 1620 1630 1640 1650 
human GACACCTACGGTGTCAACTGTTCTGCACGCTGCTCATGTGAAAATGCCATCGCCTGGTCACCCATCGACG 

rat AACACTTATGGGATCMC^ 

1490 1500 1510 1520 1530 1540 1550 

1660 1670 1680 1690 1700 1710 1720 
human GCGAGTGCGTCTGCAAGGAAGGTTGGCAGCGTGGTAACTGCTCTGTGCCCTGCCCACCCGGAACCTGGGG 

rat GCACGTGCATCTGCAAGGAAGG^^ 

1560 1570 1580 1590 1500 1610 1620 

1730 1740 1750 1760 1770 1780 1790 
human CTTCAGTTGCAATGCCAGCTGCCAGTGTGCGCATGAGGCAGTCTGCAGCCCCCAAACTGGAGCCTGTACC 



rat CTTCAGTTGCAATGCCAGTTGCCAGTGTGCCCACGAGGGAGTCTGCAGCCCCCAAACTGGAGCCTGTACT 
1630 1640 1650 1660 1670 1680 1690 

1800 1810 1820 1830 1840 1850 1860 
human TGCACCCCTGGGTGGCATGGGGCCCACTGCCAGCTGCCCTGTCCGAAGGGGCAGTTTGGAGAAGGTTGTG 



rat TGCACCCCTGGGTGGCGTGGGGTTCACTGCCAACTTCCGTGCCCGAAGGGACACTTTGGTGAAGGTTGTG 
1700 1710 1720 1730 1740 1750 1760 

1870 1880 1890 1900 1910 1920 1930 
human CCAGTCGCTGTGACTGTGACCACTCTGATGGCTGTGACGCTGTTCATGGACGCTGTCAGTGCCAGGCTGG 



rat CCAGTGTCiGi&ciGi^ 

1770 1780 1790 1800 1810 1820 1830 

1940 1950 1960 1970 1980 1990 2000 
human CTGGATGGGTGCCCGCTGCCACCTGTCCTGCCCTGAGGGCTTATGGGGAGTCAACTGTAGCAACACCTGC 



rat CTGGATGGGCACACGTTGCCACCTGCCTTGCCCAGAGGGCTTTTGGGGAGCCAACTGCAGCAATGCCTGT 
' 1840 • 1850 1860 1870 1880 1890 1900 
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2010 2020 2030 2040 2050 2060 2070 
human ACCTGCAAGAATGGGG6CACCTGTCTCCCTGAGMTGGCAACTGCGTGTGTGCACCCGGATTCC6GGGCC 

rat ACCTGCMGAATGGTGGCAC^ 

1910 1920 1930 1940 1950 1960 1970 

2080 2090 2100 2110 2120 2130 2140 
human CCTCCTGCCAGAGATCCTGTCAGCCTGGCCGCTATGGCAAACGCTGTGTGCCCTGCAAGTGCGCTAACCA 

rat CCfCCtGCCAGAGGCCCtGCCCGCCTCCTCGGW 

1980 1990 2000 2010 2020 2030 2040 

2150 2160 2170 2180 2190 2200 2210 
human CTCCTTCTGCCACCCCTCGMGGGGACCTGCTACTGCCTGGCTGGCTGGACAGGCCCCGACTGCTCCCAG 

rat TTCTTCCTGCCACCCGTCG^ 

2050 2060 2070 2080 2090 2100 2110 

2220 2230 2240 2250 2260 2270 2280 
human CCATGCCCTCCAGGACACTGGGGAGAAAACTGTGCCCAGACCTGCCAATGTCACCATGGTGGGACCTGCC 



rat CATGTCCCCCAGGCCACTGGGGACTCAMTGCTCCCMCCCTGCCAGTCTcAtCATGGtGCCACCtGCC ' 
2120 2130 2140 2150 2160 2170 2180 

2290 2300 2310 2320 2330 2340 2350 
human ATCCCCAGGATGGGAGCTGTATCTGCCCCCTAGGCTGGACTGGACACCACTGCTTAGAAGGCTGCCCTCT 

rat ACCCCCAGGATGGGA(K^ 

2190 2200 2210 2220 2230 2240 2250 

2360 2370 2380 2390 2400 2410 2420 
human GGGGACATTTGGTGCTAACTGCTCCCAGCCATGCCAGTGTGGTCCTGGAGAAAAGTGCCACCCAGAGACT 

rat AAGAATGTTTGGTGTCM^ 

2260 2270 2280 2290 2300 2310 2320 

2430 2440 2450 2460 2470 2480 2490 
human GGGGCCTGTGTATGTCCCCCAGGGCACAGTGGTGCACCTTGCAGGATTGGAATCCAGGAGCCCTTTACTG 

rat GGGGCTTGCGTCt^ 

2330 2340 2350 2360 2370 2380 2390 

2500 2510 2520 2530 2540 2550 2560 
human TGATGCCGACCACTCCAGTAGCGTATAACTCGCTGGGTGCAGTGATTGGCATTGCAGTGCTGGGGTCCCT 

rat TAATGCCCACCTCTCCTGT^^ 

2400 2410 2420 2430 2440 2450 2460 

2570 2580 2590 2600 2610 2620 2630 
human TGTGGTAGCCCTGGTGGCACTGTTCATTGGCTATCGGCACTGGCAAAAAGGCAAGGAGCACCACCACCTG 

rat TGTGGTGGCCCTGGTAGCAC 

2470 2480 2490 2500 2510 2520 2530 

2640 2650 2660 2670 2680 2690 2700 
human GCTGTGGCTTACAGCAGCGGGCGCCTGGACGGCTCCGAGTATGTCATGCCAGATGTCCCTCCGAGCTACA 



rat GCAGTGGCTTACAGCACTGGGCGACTGGATGGCTCCGATTACGTCATGCCAGATGTCTCTCCGAGCTACA 
2540 2550 2560 2570 2580 2590 . 2600 
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2710 2720 2730 2740 2750 2760 2770 
human GTeACTACTACTCCMCCCCAGCTACCACACCCTGTCGCAGTGCTCCCCAMCCCCCCACCCCCTAACAA 

rat GTCACTACTATTCCAACOT^ 

2610 2620 2630 2640 2650 . 2660 ' 2670 

2780 2790 2800 2810 2820 2830 2840 
human GGTTCCAGGC- ■ -CCGCTCTTTGCCAGCCTGCAGAACCCTGAGCGGCCAGGTGGGGCCCMGGGCATGAT 

rat GAftcCAGGCAGTCAGCtGtiTGTCAGCTCCCAGGCATGTC^ 

2680 2690 2700 2710 2720 2730 2740 

2850 2860 2870 2880 2890 2900 2910 
human AACCACACCACCCTGCCTGCTGACTGGAAGCACCGCCGGGAGCCCCCT-CCAGGGCCrCTGGACAGGGGG 

rat AACCACGCCACACTGCCCGCTO • - -TTTCCT CAGGC 

2750 2760 2770 ' 2780 2790 2800 

2920 2930 2940 2950 2960 2970 
human AGCAGCCGCCTGGACCGAAG CTACAGCTATAGCTACAGCAATGGCCCAGGCCCATTCTACGATA 

rat ACCAGCCACCTGGACCG^ 

2810 2820 2830 2840 2850 2860 2870 

2980 2990 3000 3010 3020 3030 3040 
human MGGGCTCATCTCTGAAGAGGAGCTCGGGGCCAGTGTGGCTTCCCTGAGCAGTGAGAACCCATATGCCAC 

rat AAGGTCCCATGtGTG^ 

2880 2890 2900 2910 2920 2930 2940 

3050 3060 3070 3080 3090 3100 3110 
human CATCCGGGACCTGCCCAGCTTGCCAGGGGGCCCCCGGGAGAGCAGCTACATGGAGATGAAAGGCCCTCCC 



rat CATCCGAGACCTGCCCGGCCTGCCTGGGGAACCCCGAGAAAGCAGCTATGTGGAGATGAAAGGCCCTCCA 
2950 2960 2970 2980 2990 3000 3010 

3120 3130 3140 3150 3160 3170 3180 
human TCAGGATCTGCCCCCAGGCAGCCTCCTCAGTTTTGGGACAGCCAGAGGCGGCGGCAACCCCAGCCACAGA 

rat TCAGTGtctcCCCCCAGGCAGCCfCTtCATCtcCGGGA^ - • CAGCAGCAACTGCAGTCf CAGA 

3020 3030 3040 3050 3060 3070 3080 

3190 3200 3210 3220 3230 3240 3250 
human GAGACAGTGGCACCTACGAGCAGCCCAGCCCCCTGATCCATGACCGAGACTCTGTGGGCTCCCAGCCCCC 



rat GAGACAGCGGCACCTATGAGCAGCCCACTCCCTTGAGCCGTAATGAAGAGTCTGTGGGCTCCAT6CCCCC 
3090 3100 3110 3120 3130 3140 3150 

3260 3270 3280 3290 3300 3310 3320 
human TCTGCCTCCGGGCCTACCCCCCGGCCACTATGACTCACCCAAGAACAGCCACATCCCTGGACATTATGAC 



rat TCTTCCTCCGGGCCTGCCACCCGGCCACTATGACTCGCCCAAAAACAGCCACATCCCTGGACACTATGAC 
3160 3170 3180 3190 3200 3210 3220 

3330 3340 3350 3360 3370 3380 3390 
human TTGCCTCCAGTACGGCATCCCCCATCACCTCCACTTCGACGCCAGGACCGTTGAGGAGCCAGGATGGTAT 



rat TTGCCTCCAGTACGGCATGGTGGATCACCTCCATCCCGGCGCGAGGACCGCTGAGGAGCCAGCATGGTAT 
3230 3240 3250 3260 3270 3280 3290 
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3400 3410 3420 3430 3440 3450 3460 
human GGCAGAGGCCAGCACACCTGGCTGTT6CTGCTCAAGGCTGG6GACAGAGCCTAGT6TACCCCT6CCA66A 

rat GG- -GAG ■ AGTGCCt - GTGAACCC - IgCCAggA 

3300 3310 3320 

3470 3480 3490 3500 3510 3520 3530 
human GCAGGGAGTGGACCGGC^GGCTG^ 

rat GCA6GGCCt6GACCAGCAGGC ----- -cAtGAA tAGACATA 

3330 3340 3350 



3540 3550 3560 3570 3580 3590 3600 
human TGGGTTCTACCATGGGAGACGCTGATCAGCAGGATGCCTGGCTCCCTTTCCCAACCCACTGCTCCCAAGG 

rat CftGG-------tGAA 

3360 

human^CCTCCAGGGCCCTGTGTACA^ 



rat GTCMCCGAgAcTG- -AGGAtgG- 

3370 3380 



3680 3690 3700 3710 3720 3730 3740 
human GTGGGGTACCTTTTCTGTGCATGCTCAGCCTGGGCTCTGTGCGTGTGTGTGTTTCTGTGATTTTAGM 

rat CTCTGC--- 

3390 

3750 3760 3770 3780 3790 3800 3810 
human GTACCAGGCAGGTTCTGTCCTAGGGCACTTACCATTTAGTAGGGAGATGGAACCAACCCMTTAACTCTA 

rat -ftccA ccgAggg AgAcActA 

3400 3410 

3820 3830 3840 3850 3860 3870 3880 
human GCMTAGCCTCCTMCTGGCCTCCTCCATTGATTCAGTGAACCTTCCAATGCATGGCTCATAATTTCAAA 



rat G • un c 



3890 3900 3910 3920 3930 3940 3950 
human ATACAGGCTGGTTAGTTACTCCCTACCTGAAAGCCTTCATAGGTGCCTCTTTGCTCTTCTGCCAGTATCA 

rat AAAg - - 



human 39 AMCTmGAAGGCCn^ 



rat - TGTCT 

3430 

4030 4040 4050 4060 4070 4080 4090 
human GTTCCTGTCACTGCACGCCAGTCACACCGGCCTCTAGGTCCTCCTGTAGGCCACTCTTCTTTCTGGCACA 



rat -AACCtCC- 
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4100 4110 4120 4130 4140 4150 4160 
human 6GGACCTGCACACCTG(^GTGCCCTTCCTCCCCCACTCGCCTGnCACCCCTGCTmCCTTTACACCTC 



rat --- CTTTTCC 

3440 

4170 4180 4190 4200 4210 4220 4230 
human CTCCTCAGGGMGTGCCCACCCTCCGTACATCTTTCACAGCCCTGATTGCAGCTGTGTTCACTCACCAGG 

rat - AGCCC--AttGtt CAAG 

3450 

4240 _ 4250 4260 4270 4280 4290 4300 
human TACCTGCAGMGGCCTACAGGGTGCCAGGCACTTCTnMTGGGTTCTTTCTTTATGTGATTATTTGATT 

rat t - 

3460 

, 4310 4320 4330 4340 4350 4360 4370 

human MTCTCTGCCTCCCCCACTAGACTGTAAGCTCCCTGAAGGCAAGMTCCTGTGCTTATGCTCAATATTAG 

rat CCCCCA 

4380 4390 4400 4410 4420 4430 4440 
human CTCTCCCTTGGCACAGAGTAGGCACTCMCAAATGCTCCCCAAMGGCTGAGT6GCTGACTGAATTAAGT 

rat GGCTGTG-- --- 

3470 

4450 4460 4470 4480 4490 4500 4510 
human ACCAGTGACATGCAGTAACTGCTAAGATAGATGAGCCATCTGTATGCTCTGACAGTTACAGACTGAATAA 

rat GACATG- - ---- ----- 

4520 4530 4540 4550 4560 4570 4580 
human GTTGGAGACTTCCCTAAAGGGTGGCATTTCCCCAGGGTAACMCGCAGAGCTCAGGTGTGGGAAGGTGCC 



rat 



4590 4600 4610 4620 4630 4640 4650 

• " — xr • 



human AGGGGCAGGGGTGCAGAGGGGCTGAGGCTGAGGGGGGTGCAGAGGCTGGAGAAAGGATAACAGGAGAGAG 

rat AGCTGGTGG 

3480 

4660 4670 4680 4690 4700 4 710 4720 
human TATACAGGCATGCCTTGATnATTGCACTTCACAGGTAGCAGMTTTTTAMGAMTTGMGGTTTTGGG 



rat GCAGAATGTT GTTGTTGAAG 

3490 3500 

4730 4740 4750 4760 4770 4780 4790 
human ACATATATGTGACAGCMTAGGTTMGAAMGCAMGCAGAGAMTTGAAGATTTGTGTCAACACTGCTT 

rat - - 
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4800 4810 48 20 4830 4840 4850 4860 
human TMGCAMTCTGTTGGCACCATTTTTCCMTAGCATGTGCCCATTTT6GGTCTCTACATT6CATTTTGGT 



rat --TCTG ATTTTAGAT-- 

3510 3520 

4870 4880 4890 4900 4910 4920 4930 
human MnGCTTGCMTATTTCMGCATTTTCATTGTTATTATATGTGTTATAGTGATCTGTGATCAGTGATCT 

rat - 

4940 4950 4960 4970 4980 4990 5000 
human TTGATATATTATTGTAAnGTTTCGGGGCGCCATGMCCGCACCCATATMCACGGTAMCTTAATCAGC 

rat -tGAtiiliTAAAAAAAA - 

3530 

5010 5020 5030 
human AAAAAAAAAAAAAAAAAAAGGGCGGCCG- 

rat mmm^^ 

3540 3550 3560 
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10 20 30 40 50 

inputs GTC -GACCCACGCGTCCGG- - - TGACCCTGTTCATGGACAGT GCCGATGTCAGG- - -CTGGT- - - 

GTCCGACCCACGCGTCCGAGCCACA^^ 

10 20 30 40 50 60 70 

60 70 80 90 100 110 

i nputs TGGATGGGCACA - CGCTGCCAC - - - CTGCCTTG ■ CCCGGA- - GG- - GCTTTTGGGGAG - CCAAC - TGCAG 

TGGAATTCCC^ 

80 90 100 110 120 130 140 

120 130 140 150 160 170 

i nputs ■ TAAC ACCTGTACC - TGCAAGAATGGTGGTACCTGTG ■ -TGTCT-GAGAATGGCAACTGCCTGTGCGCAC 

CTGGCTGCAGTGC^ - C - TGTCCCCTC 

150 160 170 180 190 200 

180 190 200 210 220 230 

inputs CAG- - - -GGTTCCGAGGCCC-CTCCTGCCAGAGGCCCTGCCCGCC- - TGGTCGCTATGGCAA - AC - -GCT 

CTCCTCCTGGCCCTAGGCCTGCGTCTGGCTGGAACACTCMCTCC 
210 220 230 240 250 260 270 

240 250 260 270 280 
inputs GTGT- -GCAATGC AAGTGT- • - AACAACAACCATTCTTCCTGCCACCCATCG 

GGGAMGCTTCACCACGACCACTAAGGAGTCCckCTTCGC 

280 290 300 310 320 330 340 

290 300 310 320 330 

inputs -GACGGGACCTG- CTCCT-GCCTG- - - GCGGGCTG - GACAGGC - -CCTGACTGC- -TCCG- -AG 

CGACAGGCCCTGGGMGACCCC^ 
350 360 370 380 390 400 410 

340 350 360 370 
i nputs GC- - ATG- • -TCCC- • CCAGGCCA CTGGGG ACT- CAAATGCT CC - • - - 

GTGGTGMGAfGGACTCCCGCCCACGCCTGCAGTGCTGTGGGGGTTACTAC 
420 430 440 450 460 470 480 

380 390 400 410 
inputs - -CAACTCTG- - -CCAG- - - -TGTCATCA TG-GTGGGACCT GCCA. CCCC- - - 

TCCCACTCTGTGCCCAGGAGTG^ 

490 500 510 520 530 540 550 
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420 430 440 450 460 
i nputs CAGGATGGGAS- - - -CTGTATC TGCACGCCAGGCTGGACTGGACC - CM - - CTGC 



CTGGCGGGGTGACGACTGTTCCAGTGAGTGTGCTCCTGGAATGTGGGGACCACAGTGTGACAGGCTCTGC 
560 570 580 590 600 610 620 

470 480 490 500 
i nputs TTGGAAGGCTGC CCA CCAAGAATGTTTGGTGT CAACTGCTCC 

CTCTGTGGCMCAGCAGTOCt™ 

630 640 650 660 670 680 690 

510 520 530 
Inputs C AGCTATGTC- -AGTG TGATCT CGGAGAGATG TGC 



CCGACTGCCTTCAGCCTTGCCCCGATGGCCACTATGGTCCTGCCTGCCAGTTTGATTGCCATTGCTATGG 
700 710 720 730 740 750 760 

540 550 560 570 580 
inputs - -CACCCAGAGAC TGGGGCTTGTCTCTGTCCCCCAGG ACACAG TGGTG 



GGCATCCTGTGACCCCCGGGATGGAGCCTGCTTCTGCCCCCCAGGGAGAACAGGACCCAGGGCACTGATG 
770 780 790 800 810 820 830 

590 • 600 610 620 

inputs CAGAC TGCAAAATGGGAAG - - - -CC- -AGGAGTC-CTT- -CACCATAA- 

GCTTCTtCTGCCCCAGMCTTATCCTTG^^ 
840 850 860 870 880 890 900 

630 640 650 
inputs -TGCCCACC TCT- - -CCCG- • -TGACCCATAA CTC ACTGG 

CTGCCCACCGGGCTGGATGGGTGTCA^^ 
910 920 930 940 950 960 970 

660 670 680 690 700 710 

inputs GTGCAGTGATTGGCATTGCAGTACTGGGAACCCTCGTG- - - -GTGGCCCTGATAG- - -CACTGTTCAT-T 

ACTCAG-GAAiGTCGiTGCC^^ 

980 990 1000 1010 1020 1030 1040 

720 730 740 
1 nputs GGCTA CCG --------- ■ CCAGTGG - - CAAAA- - GGGCAAGGAACA 

GGCtATATCGGGGATCGGTGCCCT^^ 

1050 1060 1070 1080 1090 1100 1110 

750 760 770 780 790 

inputs TGAGCACTTGGCA- - -GTGGCTTAC AGCACTGGGCGG- - CTGG- ATGGCTCTGATTA 



GTGACTGTGCTCCTGGCGCTCGTTGCTTTCCTGCCAATGGCGCGTGTCTGTGCGAACATGGCTTCACAGG 
1120 1130 1140 1150 1160 1170 1180 
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800 810 820 830 840 850 
inputs CGTCA- - TGC - CA6AT • 6TCTCT - -CCGA GCTATAGTCACTACTACT CCAACCCCAGC 

CGACCGCiGCACTGAGCGACTCT 

1190 1200 1210 1220 1230 1240 1250 

860 870 880 890 900 

i nputs TACC - - ACACACTGTCTCAGTGTTCTCCTAACCGCCCGC CCCCTAACA - - -AGGTCC - - CAGGCA 

GACCCAGAACACAGtCTCAGCTGCCACC 

1260 1270 1280 1290 1300 1310 1320 

910 920 930 940 950 
inputs G- -TCAGCT-CTTTGTCAGCTCTCAGGCC-C- - - CTGAGC GGCCA- - AGCAGAGCC - - -CA 

TCCACTGCAACGAGAGCTGCCCtCAGGACAC^ 
1330 1340 1350 1360 1370 1380 1390 

960 970 980 990 1000 1010 
i nputs CGGGCGTCAGMCCATACCACACTGC - - CCGCTGACTGGAAGCACC - - GC CGGGAGCCC C 

CGGCGGf GTTtGCCf CGCCG - ACAGCGGCCTCtGCCGGTGTGCACCTGGCTACACGGGACCTC^ 
1400 1410 1420 1430 1440 1450 1460 

1020 1030 1040 1050 1060 

inputs ATGACAGAGGC - GCCAGCCAC CTGGACCGM- GCTATAGCTGTA- - - -GCTATAGCC 

TAATCTTtGTCCACCTAACA 

1470 1480 1490 1500 1510 1520 1530 

1070 1080 1090 1100 1110 

inputs A- - - - - - CAGG - AATGGCCCAGG - -AC- -CATT- - - - • CTGTCATAAAGGTCCCATCTCTGAA- - - -GA- 

iGCTCTCCtGTCGACGGCACGTG^ 

1540 1550 1560 1570 1580 1590 1600 

. 1120 1130 1140 1150 1160 
inputs GGGACTAGGGGCAAGCGTTA-TGTCCCTGA-GCAGTGAGAACCC-CTA TGCTACC - - - 

CCCCTGGCACCtGGGGCn^ 

1610 1620 1630 1640 1650 1660 1670 

1170 1180 1190 1200 1210 
inputs -ATCCGAGACCTG- CCCAGCCTGCC - TGGGGAAC - - - -CC CGAG- - -AAAGTGGCT 

AAACTGGAGCCTGTACTO 
1680 1690 1700 1710 1720 1730 1740 

1220 1230 1240 1250 1260 
inputs ATGTGGAGATGAAAGGACC- - - -TCCAT- - CAGTGTCCCCTCCCA - GGCAGT - - - -CTCTTCAT C 

GtnGGtGMGGTTCTGCCAGTGtCT^ 

1750 1760 1770 1780 1790 1800 1810 
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1270 1280 1290 1300 1310 

inputs T - CCGG - GACAGGCAG • CAG CGG- - -CAACTGC- -AGCCACAGAGGG- -ACAGCGGCACC 

TGCCGATGTCAGGCT GGCTGGATGGGCACACGTTGCCACCf GCC7TGCC ■ CAGAGGGCTTITGGGGAGCC 
1820 1830 1840 1850 1860 1870 1880 

1320 1330 1340 1350 

inputs TA-TG-AGCA- -GCC CAGC CCCTTGAG- - CCATAATGAAGAGTCTTTGGG - - - - 

( J * ' J J J J * * I t • • • • • k ■ * . * • . .»».. 

MCTGCAGCAATGCCTGTACCT^ 

1890 1900 1910 1920 1930 1940 1950 

1360 1370 1380 1390 1400 
inputs CTCCA C- - - -GCCCCCGCTTCCTCCAGGCCTGCC-TCCTGGTCACTACGACT- -C CC 

CACCAGGGTrCAGAGGCCCCTCctGCCAGAGGCCCfGCCCGCCtGGfc 
1960 1970 1980 1990 2000 2010 2020 

1410 1420 1430 1440 1450 

inputs C--CAAG- - -AACAGCCATA-TCCCTG GAC ACTATGACTTGCCT - -C- - -CAGTAC- 

CTGCAAGTGCMCAACCAftCTTCCtGCCACCCGfCGGATGGGA^ 
2030 2040 2050 2060 2070 2080 2090 

1460 1470 1480 
inputs GGC- - -ATC--CTC CAT- - -CCCCT- -CCA - TCCCGGC- - -GCCAG-GAC 

GGCCCTGACTGCtCTGMTCAtGTCCCCCAGGCCACTGGGGACTC 

2100 2110 2120 2130 2140 2150 2160 

1490 1500 1510 1520 1530 1540 
inputs CGC - TGAAGA - GCCGGCAT - - - - - GGTATGGGAGC - GTGCCTATGTACCTTGC - - • -CAGGA G 

ATCAtGGTGCCACCTGCCACCCCCAGG^ 
2170 2180 2190 2200 2210 2220 2230 

1550 1560 1570 1580 
inputs CAGGGACTG - - GACCAGCAGG CCACG- - - - - -AACAGAAACA CTTGGTGAA 

ciCGGMGGCTGCCCATGMGM^ 
2240 2250 2260 2270 2280 2290 2300 

15901600 1610 1620 1630 
i nputs GTGAAC- - - - -AGAGACGGACTGTGGC - CCTGTGCTTC- - - -CACCGAGGGAGACACT- - - - AGTTGACA 

» » * *••«••• ••■ ••••• • • • » • ■*•«»« 

AtGTGCCACCCA&GACTGGGGm^ 

2310 2320 2330 2340 2350 2360 2370 

1640 1650 1660 1670 1680 1690 

i nputs - - -AAGTGTCTAAC - CCTCTTTTCCAACC - CAC - - -TGCTC - - - AAGTCCCTGTGGAC - - - - ATAAGC - - 

GCCAGGAGTCCttCACCATAAtGCCCA^ 
2380 2390 2400 2410 2420 2430 2440 
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1700 1710 1720 1730 174 0 

inputs TGGTGGGCAGAA TGTTGTTGTACMGTG - - ■ -TGATTTTAG- - -ATCG AI 1 1 1 1 1 1 1 IA AAGT- 

tGCAGTGCTGGGGACCCT^^ 
2450 2460 2470 2480 2490 2500 2510 

1750 1760 1770 1780 1790 1800 1810 

inputs ATGTGTTGGGTAC-CTTTTCTGTG-TGTATGCTCAGGCAGGCTGTGTGTGTCTCTAGTTGGCTTTAGAG 

MGGAACATGAGCACfTGGCAGtGGCTACAGCACTC^^ 
2520 2530 2540 2550 2560 2570 2580 

1820 1830 1840 1850 1860 1870 

inputs GGAGTC AGGTATAGGTTCTGCCTT - -CTGCACT- - • TTCCA - TCT - TATCT - AGTAGTCAGCTT 

• ••■•** • a a « • a a « »••* • a • • • » a a mm * a 

GAtGTCTCTCCGAGCTACA^^ 

2590 2600 2610 2620 2630 2640 2650 

1880 1890 1900 1910 1920 

inputs - CCAAGCTTAACTAGTTAGAGCTCCA ■ -C- - -CAGCAG- - - - - CAG - GCCCTAACTAC - - - -CTGCCTGC 

ACCCfcCACCCCCTAACAAGATtcCAGGCAGTCAGCTGTTTGTCAGCTC^ 

2660 2670 2680 • 2690 2700 2710 2720 

1930 1940 1950 1960 1970 

inputs CCTTCACC C - AGTAATCCTC - CATGTCTTTGCTCAGA- GGATTGCTCC - CCGA - ■ - -CTCT- - - - 

CAGAAACCATGGGCGAGATMCCACGCCACA^ 

2730 2740 2750 2760 2770 2780 2790 

1980 1990 2000 2010 2020 
inputs GGTGTTGTCCTCCTG - - ■ -GTACGCCTTGAC- • ■ -GGTCCTGCAGT--CT- - - -CC-C TTTCCCG 

>•■■•••• ► »•«••*» * a a • • a a • » • • * * a « a • 

AGAGCmCCtCAGGCACCAGCCACCtG^ 

2800 2810 2820 2830 2840 2850 2860 

2030 2040 2050 2060 2070 2080 

inputs T- - - CTTGCT-TCATT CTTTCCCAGAATGAAGGCTGTCTGCCACCCTACT-TCCCAGCCCAGGA 

GGGCCATTCTGTCATAMGGTCCCATCTCT^ 

2870 2880 2890 2900 2910 2920 2930 

2090 2100 2110 2120 2130 2140 

i nputs A- • - • - - -TTGGCA- -CATCTAAGTTCAGCC • • - - - TTCCTAAGTTACCCGTTGAGTCCTGCTTGCCCTT 

AGAACCCCTATGCGACCATCCGA&C^ 

2940 2950 2960 ' 2970 2980 2990 3000 

2150 2160 2170 2180 2190 2200 

inputs CACATAT- - - - -TCCA-CAGAA-CACCCACC- - - - - CCAC ATCTGCTTCATAGCTACTCTCTTCTCC AC 

GATGAAAGGCCCfCCATCAGfGTCfcCCCCCAGGCAGCCfciTCA 

3010 3020 3030 3040 3050 3060 3070 
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2210 2220 2230 2240 2250 2260 

inputs GTACCCACAGMGGCAGMGTGGTACCAGGCMGMGATGGGA- - ■ TTGTTGCATTTTGTTTTGTTTTTG 

CTGCAGtcfcAGAGAGACAGCGGCACCtAT-GAGCAGCC^ 

3080 3090 3100 3110 3120 3130 3140 

2270 2280 2290 2300 2310 2320 2330 

inputs AGACTCTGT - CTCACTATGTAGTCCTGGCTGGCCTG - -GAACTCAAGAGCTCTGCCTGCCTCTGCCTCTT 

GG-CfcCAfGCCCCCfct-fccfcCGGGCCTGCCACCCGGCCAC 

3150 3160 3170 3180 3190 3200 3210 

2340 2350 2360 2370 2380 

1 nputs - • • - GAGTGCTGGGTTTA ACGGCT- ■ CAGGGTCACATGCA- - - CAGCTC AAGCTGCACT - - 

CCCTGGACACtATGACtTGCCTCCAGTACGGCATCCTCCATCACCTC 

3220 3230 3240 3250 3260 3270 3280 

2390 2400 2410 2420 

i nputs - - - -CCGA TGTGCTT- - - -TCCC- - CTGTTGCTAGATTAGCGTCTGCCTCCC - - - - 

GGAGCCAGCATGGTATGGGAGACTGCCTGTGMCCCTGCCA 

3290 3300 3310 3320 3330 3340 3350 

2430 2440 2450 2460 2470 

inputs CCTAGTGGAG AGGCTGA- - -TCGC-CAGCT- - CTCTGATGCAGGACTCTGGT - • 

ATAGACATACnGGTGMGTGMCGGAGA^ 

3360 3370 3380 3390 3400 3410 

2480 2490 2500 2510 
inputs GTTTAGGCTCA- - • CTCACTATTGGTTTCCTTGGCACAGG GTAGTCA- - - -CT 

GCAAAGTGtCTMCCtcCCttTtcCAGCCCAiTGCtc 
3420 3430 3440 3450 3460 3470 3480 

2520 2530 2540 2550 2560 

i nputs CAA TAAATGTTCC --TCT - - - - AAMGCTGAAAAAAAAAAAAAAAAAGG 

WGMTGnGHGiTGMGTC^ 
3490 3500 3510 3520 3530 3540 3550 

inputs GCGGCCGC 

GCGGCCGC 
3560 
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10 20 30 40 50 60 70 

i nputs MAPARAGFCPLLLLLLLGLWVAEI PVSAKPKGMTSSQWFKIQHMQPSPQACNSAMKNINKHTKRCKDLNT 

MV LCFPLLLLLLVLWGPVCPLHAWPKRLTKAW 

10 20 30 40 50 60 

80 90 100 110 120 130 140 

inputs FLHEPFSSVAATCQTPKIACKNGDKNCHQSHGPVSLTMCKLTSGKYPNCRYKEKRQNKSYVVACKPPQKK 

FLHDSFQNVMVCDLLSIVCKNRRHNCHQSSKPVNMTDCRLTSGKYPQCRYSMAQYKFFIW 
70 80 90 100 110 120 130 

150 

inputs DSQQFHLVPVHLDRVL 

DPP - YKLMVPVHLDSI L 
140 150 
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10 20 30 40 50 60 70 

inputs GTCGACCCACGCGTCCGGCTCCCA6CCCACCCCCAAACAGACACAGCGTAGCCCGGGCCAGCTCTTAAGG 

At GG 

80 90 100 110 120 130 140 

inputs AG1TCAGGAGTGAGMGAGGCCCTCAGAGATCTGACAGCCTAGGAGTGCGTGGACACCACCTCAGCCCAC 

TG - fit A- ■ ■ -tGCTf - - - -tcct^rtCt 

10 20 

150 160 170 180 190 200 210 

inputs TGAGCAGGAGTCACAGCACGMGACCMGCGCAMGCGACCCCTGCCCTCCATCCTGACTGCTCCTCCTA 

tTtACtG CTGC tGGTt -tfk 

30 40 

220 230 240 250 260 _ 270 280 

inputs AGAGAGATGGCACCGGCCAGAGCAGGATTCTGCCCCCTTCTGCTGCTTCTGCTGCTGGGGCT 

tGGG GACCA6tG- - tGTCCACttcA- -tGCft 6GC 

50 60 70 

290 300 310 320. 330 340 350 

inputs CAGAGATCCCAGTCAGTGCCAAGCCCAAGGGCATGACCTCATCACAGTGGTTTAAA^ 

CtAAG- • - -C-GtCt- - ■ -CA- - -CcAAGG-C -tcAc- - tGGTtf'GMAtt CAGCATAtAcA 

•80 90 100 110 

360 370 380 390 400 410 420 

inputs GCCCAGCCCTCMGCATGCMCTCAGCCATGAAAMCATTMCMGCACACAAAACGGTGCAM 

GcdAAGT6dtct iih AtgcA AcAgggcaatgA- - - ■ 

120 130 140 150 

430 440 450 460 • 470 480 490 

inputs mcaccttcctgcaggagcctttctccagtgtggccgccacctgccagacccccaaaatagcctgcaaga 

■ GtG6CATCAAC - - • AAttAtGCC 

160 170 

500 510 520 530 540 550 560 

inputs ATGGCGATAAAMCTGCCACCAGAGCCACGGGCCCGTGTCCCTGACCATGTGTAAGCTCACCTCAGGGAA 

cAg- - -cAc- - tGtAAGcA- - - - tcA- - - -A 

180 



570 580 590 600 610 620 630 

inputs GTATCCGAACTGCAGGTACAAAGAGMGCGACAGMCMGTCTTACGTAGTGGCCTGTMGCCTCCCCAG 

MtACCTttcf GCATG-AC TCtttC CAG 

190 200 210 

640 650 660 670 680 690 700 

inputs AAAAAGGACTCTCAGCMnCCACCTGGTTCCTGTACACnGGACAGAGTCCTTTAGGTTT 

AAtGtGG CtGttGt CtGt GAlitGCf ■ -CAG- 

220 230 240 

FIG.37A 
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710 720 730 740 750 760 770 

Inputs CnGCTCTTTGGCTGACCTTCMTTC 

CAttGTCtG- -CM-AAATC -GTCG- -GCACAACTGCCA CCAgAGC- - - - - 

250 260 270 280 

780 790 800 810 820 830 840 

inputs CnCCCCTCATCTCTTGGGGCTGTTCCTGGTTCAGCCTCTGCTGGGAGGCTGAAGCTGACACTCTGGTGA 

tCAAAG CCTG- -TCAACAt-GACt- • - -GACTG- - -CAGACTCAct- 

290 300 310 320 

850 860 870 880 890 900 910 

inputs GCTGAGCTCTAGAGGGATGGCTTTTCATCTTTTTGTTGCTGTTTTCGCAGATGCTTAT^ 

tCAGGiAAfi ■ tAtCCCCAG 

330 

920 930 940 950 960 970 980 

inputs GCAAGCTCAGGTCTGTGGGTTCCCTGGTCTATGCCATTGCACATGTCTCCCCTGCCCCCTGGCATTAGGG 



■-. --TGCC - GCTATAGTG 

340 350 

990 1000 1010 1020 1030 1040 1050 
inputs CAGCATGACMGGAGAGGAMTAMTGGMAGGGGGCATATGGGATTTGTGGACACAGCTGTTTCTGTTC 

CTGCT GC - C 

360 

1060 1070 1080 1090 1100 1110 1120 
i nputs CTGAACTAGMGTC17CCCCAGCTCTGACGTGGCAGTGAGGTGACCTGMGGAMGAAAMTATAAATAA 

CAGtAcAAAt- - tcttc - - - - - - - - AtfG 

370 

1130 1140 1150 1160 1170 1180 1190 
inputs ATACCACTTCATATTTGTATAGMTCCTCTMTCCCTTGTGACATAGACTTGACAGGGATTGTATGCCTT 

tfGCCt GtGAc6 CCC Ct- - -6AG - 

380 390 

, 1200 1210 1220 1230 1240 1250 1260 

1 nputs CTrTATGGATGAM^TTMGGTTTTAGAMGCnMTGMnAMGAGCTTGTCTMTTAGTTAGTAG 



--AAGAGC 

400 

1270 1280 1290 1300 1310 1320 1330 

inputs CAGAACCTGGACTTGMCCTAGGTCTCCTTGCTCTAMTACAGTGTACCTTCTACTCTACCAGTTGCGCA 

- ■ -GACC- • • - - CC CC- - - ■ • - -CtACAAGttG- - • - 

410 420 

1340 1350 1360 1370 1380 1390 1400 

i nputs AGAAAGAAGTCACTGTTACAGAGGCAAGCGGTGAACTAGGTAAGAGTTCACTCATGAAGAAACGAGTGCT 



-GTtC-CTGt-ACA CtfAGAtAGtAttcfct- 
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i nputs CTGMGA6CCAG™CCCT GTGTTGGCTGCMTAMGgISa™ 



1480 1490 
inputs AAAAAAAAAAAAAAAAAAAAAAAAAAA 

- AA 
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240 250 260 270 280 290 300 

AGGATTCTGCCCCCTTCTGCTO 

GGTGCTATGCm&T^ 

10 20 30 40 50 60 70 

310 320 330 340 350 360 370 

CCCMGGGCATGACCTCATCACAG^ 

CCTMGCGTCM&MGG^ 

80 90 100 110 120 130 140 

380 390 400 410 420 430 440 

CAGCCATGAAAMCATTMCMGCACACAAAACGGTGCAM 

GGG6AAtGAGtGG6AtcM^ 

150 160 170 180 190 200 210 

450 460 470 480 490 500 510 

CTCCAGTGTGGCCGCCACCTGCCAGACCCCCAAMTAGCCTGCAAGAAT - GGCGATAAAAACTGCCACCA 

CCAGAATGTGGCTGCTGT^ 

220 230 240 250 260 270 280 

520 530 540 550 560 570 580 

GAGCCACGGGCCCGTGTCCCTGACCATGTGTAAGCTCACCTCAGGGM 

GAGCTCMAGCCTGt^ 

590 600 610 620 630 640 ' 650 

G-AGAAGCGACAGAACAAGTCTTACGTAGTGGCCTGTAAGCCTCCCCAGAAAAAGGACTCTCAGCMTTC 

GCTGCTGC - CCAGtAC AMfTCtf CAftGtTGCCtGtGACC - C - • tAc 

360 370 380 390 400 410 

660 670 680 

CACCTGGTTCCTGTACACTTGGACAGAGTCCTITAG 

AAGTTGGttdCTGTACAC^ 

420 430 440 450 



43. 4# Identity in 477 aa overlap; score: 746 

410 420 430 440 450 460 

GGTGCAAAG- - -ACCTCAACACCTTC- - CTGCACGAGCCTTTC - - TCCAGTGTGGCCGCCACCTGCCAGA 

GGt&TAjGCTrta^^ 



20 30 40 50 60 70 

FIG.38A 



^BNSDOCID: <WO O100673A1JB> ..... . \ • •ma»^W-,.„-jjtw ) W--3' 



SUBSTITUTE SHEET (RULE 26) 



88/95 

470 480 490 ' 500 510 520 530 

CC CCCAAAATAGCCTGCMGMTGGCGATAM-MCTGCCACCAGAGCCAC6GGCCCGTGTCC 

CCTMGCGTCTCACCAAG^ 

540_ 550 560 570 580 , 590 600 

CTGACCATGTGTMGCTCACCTC^ 

CAGGGCMTGAGtGGCA-tCAACAATTAtG- - -CCCAGCACTGTMGCAtCAAMtACClTrctGCAtGA 
150 160 170 180 190 200 

C™GTAGTGGCCTG?fc 

Ct- -CTtt- - - -CCAGAATGTGGCTGCtGt^ 

210 220 230 240 250 260 270 

670 680 ' 690 700 710 720 730 

TTGGACAGAGTCCTTTAGGTTTCCAGACTGGCTTGCTCTTTGGCT - 

A- - -ACTG- - -CCAccAgAgCTCMAGC- - -CtGTCMCAtGAct<^ 



290 300 310 320 

740 750 760 770- 780 790 

- - - -CTCC-GCACCACTCCC- - -CTACA-CCCAGAGCATTCTCTTCCCCTCATCTCTTGGGGC 

^GTATCCCCAjjGCC^ 



800 810 820 830 840 850 

TG - - GTTCAGCCTCTGCTGGGAGGCTGMGCTGACACTCTGGTGAGCT^ 

AGMGAGdGAC(ic6cCdtACAAGTTGGiTCC*i"4t - ACAdtTAGAtAGtAtTdtdtAA 
400 410 420 430 440 450 



46. 5# identity in 488 aa overlap; score: 709 

tgcac6a|ccY[tct<^ -cca^SccccaaaatXgcC" -tgcaaIaatggc 

TGCt^tGCTttCCTdTTC^ 

10 20 30 40 50 60 70 

500 510 520 530 540 550 560 

GATAAAAACTGCCACCAGAGC - CACGGGCCCGTGTCCCTGACCATGTGTMGCTCA-CCTCAGGGAAGTA 

CtAAGCGtCt- - CAdtMGGfiTCAdTGGTTTGAMTTfiAG - - 6AtAtAcAGC(iAAGTdct6 

80 90 100 110 120 130 

570 580 590 600 610 620 630 

TCCGAA-CTGCAGGTACAMGAGMGCGACAGAACAAGTCTTACGTAGTGGCCTGTMGCCT^ 

tCCAAtGCAAcAGG-GCAAtGAGtGGCAtC- -McAATt-AfGCCCAGCA- -6*fGtAAGCAt6 A 

140 150 160 170 180 

640 650 660 670 680 690 700 

AMGGACTCTCAGCMnCCACCTGGTTOT - CAGACTGGC 

AMTACCtTtCtGCAtGAdT - -Ct- -tfcdAGAA- - -tGTGGCtGCTGTCtGtGATtfGCTCAGCAiTGT 
190 200 210 220 230 240 250 
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laminin_EGF: domain 1 of 4, from 3 to 37: score -1.2, E = 0.59 

*• >CdCnphGs1 sddtCdsddel f geetGqCI kCkpnvtGrrCdr . CkpG 
+ G d+ ++GqC+ C+ + +G+rC +C +G 

mT272 3 ---HASG DP VHGQCR - CQAGWMGTRCHLpCPEG 31 

yyglpsgdpgqgC<-* 
++g + +G 
mT272 32 FWG A-NC 37 



EGF: domain 1 of 4, from 37 to 67: score 19.2, E « 0.1 

*- >CapnnpCsngStC vntpggssdnf ggytCeCppGdyyl sytGkrC< - 
C+ ++ C+ngGtCv+ g C+C+pG + G+ C 

mT272 37 CSNTCTCKNGGTC VSENG NCVCAPG ----- FRGPSC 67 



DSL: domain 1 of 1, from 10 to 67: score -21.2, E - 8.1 

*- >Wstdkhi ggrtsl Gf nl eyri rvtCdenYYGsgCnkFCrPrdDaf gH 
+ ++ +p+ Ce G+ C++ c +g+ 
mT272 10 - -HGQCRCQAG WMGTRCHLPCPEGFWGANCSNTCTCK- - -NGG 47 

ytCdenGnkl CI eGWkGeyC<- * 
+enGn C++G +G+ C 
mT272 48 TCVSENGNCVCAPGFRGPSC 67 



1aminin_EGF: domain 2 of 4, from 41 to 80: score -1.5, E -. 0.63 

*• >CdCnphGsl sddtCdsddel f geetGqCl kCkpnvtGrrCdr . CkpG 
C+C + G tC s e G C+ C p++ G+ C r+C pG 

mT272 41 CTCKNGG TCVS ENGNCV-CAPGFRGPSCQRpCPPG 74 

yyglpsgdpgqgC<-* 
y + + c 

mT272 75 RY-----GKR--C 80 



EGF: domain 2 of 4, from 80 to 110: score 11.8, E - 1.9' 

*->CapnnpCsng . GtCvntpggssdnf ggytCeCppGdyyl sytGkrC< 
C + C+n++ C+++ g Tc C G +tG++C 
MT272 80 CVQC-KCNNNhSSCHPSDG TCSCLAG VfTGPDC 110 

FIG.39A 
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lamin1n_EGF: domain 3 of 4, from 83 to 123: score 25.6, E « 0.0012 

*->CdCnphGal sddtCdsddel fgeetGqCTkCkpnvtGrrC . drCkpG 
C Cn++ ++C++ +G C+ C+ + tG++C++ C pG 

mT272 83 CKCNNNH SSCHP SDGTCS-CLAGWTGPDCsEACPPG 117 

yyg!psgdpgqgC<-* 
++g1 C 
inT272 118 HWGL KC 123 



EGF: domain 3 of 4, from 123 to 153: score 27.3, E = 0*00036 

*~>CapnnpCsngGtCvntpggssdnfggytCeCppGdyy1 sytGkrC<- 
C++++ C++gGtc++ g +C+C+pG +tG++c 
mT272 123 CSQLCQCHHGGTCHPQDG - -SCICTPG WTGPNC 153 



1aminin_EGF: domain 4 of 4, from 127 to 172: score -5.5, E = 1.4 

*->CdCnphG$1 sddtCdsddel fgeetGqCI kCkpnvtGrrC . drCkpG 
C+C++ G tC++ G C C p+ tG++C + C p 

mT272 127 CQCHHQG TCHP- QDGSCI-CTPGWTGPNC1EGCPPR 160 

yyg1psg.dpgqgC<-* 
+g +++H- + +C 
mT272 161 MFG-VNCsQLC-QC 172 



EGF: domain 4 of 4, from 166 to 196: score 4.5. E = 5.8 

*- >CapnnpCsngGtCvntpggssdnfggytCeCppGdyy1 sytGkrC< - 
C++++ C+ g C++ g C+CppG +G +C 
mT272 166 CSQLCQCDLGEMCHPETG ACVCPPG HSGADC 196 



FIG.39B 
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ratT272 



92/95 

*->CaPnnpCsnqGtCvntpggssdntggytCeCppGayyl sytGkrC<- 
C++++ C+ngG C g +C+C+pG y+G+rC 
18 ! ECRCHNGGLCDRFTG QCHCAPG YIGDPRC 



48 



lam1nin_EGF: domain 1 of 11, from 22 to 61: score 12.3, E - 0.038 

*->CdCnphGsl sddtCdsddel fgeetGqCl kCkpnvtGrrC . drCkpG 
C C++ G Cd+ +tGqC+ C p++ G+rC+++C G 

ratT272 22 CRCHNGG LCDR- ------ FTGQCH - CAPGYIGDRCrEECPVG 

yyglpsgdpgqgC<-* 
+g q+C 
ratT272 56 RFG QDC 61 



55 



EGF: domain 2 of 11, from 61 to 91: score 18.3, E « 0.18 



ratT272 



*->CapnnpCsngGtCvntpggssdnfggytCeCppGdyyl sytGkrC<- 
Ca+++ C q++C + q C C +G +tG+rC 
61 CAETCDCAPGARCFPANG ACLCEHG FTGDRC 



91 



lamininJEGF: domain 2 of 11, from 65 to 105: score 4.0, E « 0.2 

*- >CdCnphGsl sddtCdsddel fgeetGqCl kCkpnvtGrrCdr . . Ckp 
CdC p + +C + G+Cl C +++tG+rC ++ C + 

ratT272 65 CDCAPGA RCFP .- • - ANGACL - CEHGFTGDRCTE r 1 CPD 98 

GyyglpsgdpgqgC<-* 
G ygl +C 
ratT272 99 GRYGL -SC 105 



EGF: domain 3 of 11, from 105 to 137: score 4.1, E - 9.6 



ratT272 



*->CapnnpCsng . .GtCvritpggssdnfggytCeCppGdyyl sytGkrC 
C-h-h- C+ ++ C++ +g +C C+pG ++G +C 
105 CQDPCTCDPEhsLSCHPMHG ECSCQPG WAGLHC 137 



lamin1n_EGF: domain 3 of 11, from 109 to 150: score 13.1, E - 0,032 

*->CdCnphGsl sddtCdsddel fgeetGqCl kCkpnvtGrrCdr . CkpG 
C+C+p sis C++ ++G+C+ C+p ++G +C+++C 
ratT272 109 CTCDPEHSLS- - -CHP MHGECS - CQPGWAGLHCNEsCP - - 

yyglpsgdpqqgC<-* 
++ + g gC 
ratT272 143 --QD---THGAGC 150 



FIG.41A 
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EGF: domain 4 of 11, from 150 to 180: score 27.7, E - 0.00026 

*- >CapnnpCsngGtCvntpggssdnf ggytCeCppGdyyl sytGkrC<- 
C++++ C++gG+C+ g C+C+pG yt&H-C 
ratT272 150 CQEHCLCLHGGVCLADSG -LCRCAPG- YTGPHC 180 



lamininJGF: domain 4 of 11, from 154 to 193: score 8.4, E » 0084 

*->CdCnphGsl sddtCdsddel fgeetGqCI kCkpnvtGrrC . drCkpG 
C C +hg + C +GC+C p++tG++C + C p+ 

ratT272 154 CLC-LHG- - - -GVCLA DSGLCR - CAPGYTGPHaNLCPPN 187 

£yg1psgdpgqgC<-* 

ratT272 188 TYG1 NC 193 



EGF: domain 5 of 11, from 193 to 223: score 10.6, E - 2.5 

*->CapnnpCsngGtCvntpggssdnfggytCeCppGdyyl sytGkrC<- 

Q | | | ,| - Q ^ Q | | . g tC { Q H»Q [ I 

ratT272 193 CSSHCSCENAIACSPVDG- --TCICKEG WQRGNC 223 



laminin^EGF: domain 5 of 11, from 197 to 236: score 0.7, E « 0.4 

**>CdCnphGsl sddtCdsddel fgeetGqCI kCkpnvtGrrCdr.CkpG . 
q q ^ q + + G C Ck++ + +C +C pG 

ratT272 197 CSCENAI ACSP VDGTCI-CKEGWQRGNCSVpCPPG 230 



yyglpsgdpgqgC<-* 
++g+ +C 
ratT272 231 TWG SC 236 



EGF: domain 6 of 11, from 236 to 266: score 11.8, E = 1.9 

*->CapnnpCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC<- 
C+ + C + G+C + g C+C+pG + G +C 
ratT272 236 CNASCQCAHEGVCSPQTG ACTCTPG WRGVHC 266 



lamin1n_EGF: domain 6 of 11, from 240 to 279: score -2.2. E - 0.73 

*->CdCnphGs1sddtCdsddelfgeetGqCl kCkpnvtGrrCdr.CkpG 
C+C + G C + tG+C C p+ G+C +C G 

ratT272 240 CQCAHEG VCSP - -QTGACT-CTPGWRGVHCQLpCPKG 273 

yyglpsgdpgqgC<-* 
+g +gC 
ratT272 274 QFG EGC 279 



FIG.41B 
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DSL: domain 1 of I, from 246 to 309: score -19.4, E - 5.2 

* ->Wstdkh1 ggrtsl Gf nl eyr 1 rvtCdenYYGegCnkFCrPrdDaf gH 

4. .{.,(..j..|.g4. -|- -H-t* Q +GeQC+ C+ H 

ratT272 246 GVCSPQTGACTCTPGQRGVHCQLPCPKGQFGEGCASVCDCD H 287 

yt . Cd . enGnkl cl eGWk6eyC<-* 
+ +Cd+ +G +C +GW+G C 
ratT272 288 SDgCDpVHGHCRCQAGWMGTRC 309 



EGF: domain 7 of 11, from 279 to 309: score 7.0. E « 5.3 

*->CapnnpCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC<- 
Ca+ + C++ C +++g +00 G + G rC 
ratT272 279 CASVCDCDHSDGCDPVHG HCRCQAG WMGTRC 309 



TamininJEGF: domain 7 of 11, from 283 to 322: score 12.7, E « 0.035 

*- >CdCnphGs1 sddtCdsddel f geetGqCl kCkpnvtGr rCdr . CkpG 
CdO h+ d Cd+ ++G+C+ O + +G+rC +C +G 

' ratT272 283 CDCD-HS DGCDP VHGHCR-CQAGQMGTRCHLpCPEG 316 

yyglpsgdpgqgC<-* 

ratT272 317 FW§ A-NC 322 



EGF: domain 8 of 11, from 322 to 352: score 17.3, E « 0.38 

*->CapnnpCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC<- 
C+ + C+ngGtCv+ g C+C+pG + G+ C 
ratT272 322 CSNACTCKNGGTCVPENG NCVCAPG FRGPSC . 352 



lamlnlnJEGF: domain 8 of 11, from 326 to 365: score -1.8, E - 0.67 

*->CdCnphGsl sddtCdsddel f geetGqCl kCkpnvtGrrCdr . CkpG 
C+C + G tC + e G C+ C p++ G+ C r+C pG 

ratT272 326 CTCKNGG TCVP ENGNCV-CAPGFRGPSCQRpCPPG 359 



yygipsgdpgqgC<-* 
y ■+ + c 
RY- 



ratT272 360 RY - - - - -GKR-C 365 



EGF: domain 9 of 11, from 365 to 394: score 18.3, E = 0.18 

*->CapnnpCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC<- 
C p C+n+ C+++ g tC C G +tG++C 
ratT272 365 CVPC-KCNNHSSCHPSDG- TCSCLAG WTGPDC 394 



FIG.41C 
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laminin_EGF: domain 9 of 11, from 368 to 407: score 24.0, E = 0.0034 



ratT272 368 



ratT272 



*->CdCnphGsl sddtCdsddel fgeetGqCI kCkpnvtGrrC . drCkpG 
C Cn+h+ +C++ +GC++ + tG++C++ c pG 

CKCNNHS- 



+C++ +GC++ + tG++C++ C pG 

-SCHP SDGTCS-CLAGWTGPDCsESCPPG 401 



yyg1psgdpgqgC<-* 
402 HWGL KC 407 



EGF: domain 10 of 11, from 407 to 437: score 24.0, E « 0.035 



ratT272 



*->CapnnpCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC<- 
C++++ C++g+tC++ g +C+C pG +tG++C 

407 CSQPCQCHHGATCHPQDG -SCVCIPG- - - - -WTGPNC 437 



lamlnlnEGF: domain 10 of 11, from 407 to 437: score 6.5, E - 0.12 

*->CdCnphGsl sddtCdsddel fgeetGqCI kCkpnvtGrrCdrCkpGy 
C+C++ + tC++ G C+ C p+ tG++C + 

ratT272 411 CQCHHGA TCHP-- QDGSCV-CIPGWTGPNCSE- - - - - 439 

yg1psgdpgqgC<-* 

g pg' | 1 1'0 l I Q 

ratT272 440 -GCPSRMFGVNC 450 



EGF: domain 11 of 11, from 450 to 480: score 8.7, E ^ 3.7 



ratT272 



* - >CapnnpCsngGtC vnt pggs sdnf ggy tCeCppGdyyl sytGkrC< - 
C++++ c+-g C++ g C+CppG +G +C 
450 CSQLCQCDPGEMCHPETG ACVCPPG HSGAHC 480 



lam1n1n_EGF: domain 11 of 11, from 454 to 489: score -6.3, E * 1.7 

*->CdCnphGsl sddtCdsddel fgeetGqCI kCkpnvtGrrCdrCkpGy 
C+C+p G + C++ etG+C+ C p+ +G +C 
ratT272 454 CQCDP-G- - - -EMCHP ETGACV - CPPGHSGAHC K 481 

yglpsgdpgqgC<-* 

ratT272 482 v8sQE-SFT--- 489 



FIG.41D 
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SEQUENCE LISTING 



<110> Millennium Pharmaceuticals/ Inc. 

<120> MEMBRANE -ASSOCIATED AND SECRETED PROTEINS AND USES THEREOF 

<130> 7853-206-228 

<150> 09/345/464 
<151> 1999-05-30 

<160> 148 

<170> FastSEQ for Windows Version 3\0 

<210> 1 

<211> 3284 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (1222) . . . (1944) 
<400> 1 

gtcgacccac gcgtccgtta tgtaactata cattttccca gaaattttag tatatgatat 60 

gattttgttt tctttcatcc cttttcccaa gcagtttatt atgaaaattt tcaaacatac 120 

agcaatgttg agaaaatttt acagtaaatg cctataccca ttacctaaat tttaccatta 180 

acattttacc ctgctggcat tattgtgctt atccatctac gtatccctct ctcccttcat 240 

tggtgtattt ctaagtaaat tgtaggcctc agtacacttc cttctgaatt cttcagcatg 300 

cacaacagta ttatattcca tttttaaaag agcaattctt gatagattta tatagttttg 3 60 

taaaatgttc atatagagct acaaatttta tctttttgtt tcttattgta tgtctagggt 420 

cctgaagggg atgctggcat tgttgggata tcaggtccta- aaggtcctat tggacacaga 480 

ggaaacactg gtccccttgg cagagaaggt ataataggcc caacaggtag aactggaccc 540 

agaggtgaaa agggctttag aggtgaaact ggtcctcaag gaccaagagg tcaaccaggg 600 

cctccaggtc cacctggagc accaggccca agaaagcaaa tggatatcaa tgctgctatt 660 

caagccttga ttgaatcaaa tactgcccta cagatggagg taacatatct ggttttattt 720 

atattggcac tgtctctcaa tataccaatt aaacagagaa aatttttgga ggccaaaatg 780 

tgacattatc tcaaagattg tatttaaaac agattgaaaa tgtgaaacca ttctcaagaa 840 

caaagtaagt gattttggta taattaaaca gaaatatatg cgtaggatgt tttgtaagga 900 

aaacatttaa atcaaaaatt tagtacfcgtt atttgtaagg aatttggtac tatccaagaa 960 

agtagttaaa tgaggttagc catgtttctt aaaatgagat atatatatta tcactactca 1020 

tttatttaaa ctctaatgat tcaatgtgta atttaaaaaa cataatacag tagacatagc 1080 

aattcttatg ttagcttgaa aactaaactt gcaaatgtga atttaacctc tttaaaagat 1140 

taaggttatt aaagcataca catatgccta tgcttaaata taaactgttc .tttacattct 12 00 

actcacaact tactacacat a atg gaa aca cat tct tct cct gcc ttg gcc 1251 

Met Glu Thr His Ser Ser Pro Ala Leu Ala 
15 10 

cat gtt ggt cct cag gat ttt ttt gtt tat ata att ctt atg atg act 1299 
His Val Gly Pro Gin Asp Phe Phe Val Tyr He lie Leu Met Met Thr 
15 20 25 

tgg cag age tac cag aat act gaa gtg act tta att gac cac agt gaa ■ 1347 
Trp Gin Ser Tyr Gin Asn Thr Glu Val Thr Leu lie Asp His Ser Glu 
30 35 40 



1 
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gag ata ttc aaa acc ctg aac tac ctt. age aat tta ttg cac age ate 1395 
Glu lie Phe Lys Thr Leu Asn Tyr Leu Ser Asn Leu Leu His Ser He 
45 50 55 

aag aat cct ctt ggc aca cga gat aac cca gca cga ate tgc aaa gat 1443 
Lys Asn Pro Leu G.ly Thr Arg Asp Asn Pro Ala Arg He Cys Lys Asp 
60 65 70 

tta ctt aac tgt gaa caa aaa gta tea gat gga aaa tac tgg att gac 1491 
Leu Leu Asn Cys Glu Gin Lys Val Ser Asp Gly Lys Tyr Trp He Asp 
75 80 85 90 

cca aat ctt ggc tgt cct tea gat gee att gag gtt ttc tgc aat ttc 1539 
Pro Asn Leu Gly Cys Pro Ser Asp Ala He Glu Val Phe Cys Asn Phe 
95 100 105 

agt get ggt ggc cag aca tgc tta cct cct gtt tct gta aca aag ttg 1587 
Ser Ala Gly Gly Gin Thr Cys Leu pro Pro Val Ser Val Thr Lys Leu 
110 ~ 115 120 

gag ttt gga gtt ggg aaa gtc cag atg aac ttc ctt cat tta ctg agt 1635 
Glu Phe Gly Val Gly Lys Val Gin Met Asn Phe Leu His Leu Leu Ser 
125 130 135 

teg gaa gec acc cat ate ate acc att cac tgt eta aac acc cca agg 1683 
Ser Glu Ala Thr His He He Thr lie His Cys Leu Asn Thr Pro Arg 
140 145 150 

tgg aca age aca caa aca agt ggc cca gga ttg cct att ggt ttc aag 1731 
Trp Thr Ser Thr Gin Thr Ser Gly Pro Gly Leu Pro He Gly Phe Lys 
155 160 165 170 

gga tgg aat ggc cag att ttt aaa gta aac act eta ctt gaa cct aaa 1779 
Gly Trp Asn Gly Gin He . Phe Lys Val Asn Thr Leu Leu Glu Pro Lys 
175 180 185 

gtg ctt tea gat gac tgc aag att caa gat ggc age tgg cat aag gca 1827 
Val Leu Ser Asp Asp Cys Lys lie Gin Asp Gly Ser Trp His Lys Ala 
190 195 200 

aca ttt ctt ttt cac acc cag gaa cct aat caa ctt cca gtg att gaa 1875 
Thr Phe Leu Phe His Thr Gin Glu Pro Asn Gin Leu Pro Val He Glu 
205 210 215 

gta caa aaa ctt cct cat etc aaa act gaa cga aag tat tac att gac 1923 
Val Gin Lys Leu Pro His Leu Lys Thr Glu Arg Lys Tyr Tyr He Asp 
220 225 230 

age agt tct gta tgc ttt ctg taaagtctct gaattagttc cgaattcagg 1974 
Ser Ser Ser Val Cys Phe Leu 
235 240 

ctgttggcca ggtaattget gcagagggag aaataagaca gacagataca gtcattatga 2034 

aatgcatgta ataaagcatt ggctaaatct taaagaatct caggaagaac agacttcctc 2 094 

ctaagaagga gaaaaggcat ttttaaagga- ctatgattga taaagtattt aattctttta 2154 

aaaattatat tcatctcagc tttcttagag aattccctag aactaaaaat ttataaatat 2214 

ggaattcttc agggtatctt atatttttga ctgagtgcgt agtacccatt agacagctgg 22 74 

agatgeagag cactatggag caatactggc taatgettec agatgtgcac tgcttctgtc 2334 
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taaa&attac aagccacagt ctaatatgtc 
gtccccgatg ggcatataca tcttagccgg 
tgtgttgctt ggtgctcttt cgaaaacaag 
tttcatcttt gtcattcttt aaaagtgtat 
gttagtactt attttaattt gtttggtcac 
gaagtccttg ttttatttta aaattctctt 
taacctgtgc ttgtacgcaa aagaattaga 
gttgtaaaaa ttattatagg ccagctacat 
gttgtgccat actgttttta aagttcatga 
ttttgtaaag ttttaattca gcaaattttt 
ctttatattt ctgctttgta gaaattatat 
tgtacttaaa tttagtgtta gtactttaaa 
tccagaaaaa aaaaagtctt ttcccattta 
gttatcagag aaatattagt tcaatactga 
aaaagcttgt tcatccatta taaatatatc 
ctataataaa aatgtgcttt aaataaaaaa 

<210> 2 

<211> 241 

<212> PRT 

<213> Homo sapiens 



ttattttcca aaacactaag ctgtattcag 2394 

tgatacacta cctcttacgt gttgcctctt 2454 

gtgcttatgg ctttcataga ctatttcctt 2514 

gtactggtta catcaagata tgttttggtt 2574 

aoacttaata acacatgaaa ctatttatgt 2 634 

tgtgtatttg gaatcaaagc cagcacattg 2694 

tttctttgtt tttgttttat tttttaaatt 2754 

ctagtagtag gtttggggta cagattgggg 2814 

tcatctggaa tgatacttag tgtatatata 2 874 

tgaaattgct gctgttttaa attataaaac 2934 

gttttgtagt attcattgat tttctttcac 2994 

atttttaatt taccagtctt taaagcaaca 3 054 

aaataggctc agccagttca atgtcgcctt 3114 

aagaaaaata ttatacctct tggtatctag 3174 

tttagccaca gcaaaccaca cttaacctat 32 34 

aaaaaaaaaa agggcggccg 3284 



<400> 2 



Met 


Glu 


Thr 


His 


Ser 


Ser 


Pro 


Ala 


Leu 


Ala 


His 


Val 


Gly 


Pro 


Gin 


Asp 


1 








5 










10 










15 




Phe 


Phe 


Val 


Tyr 


He 


lie 


Leu 


Met 


Met 


Thr 


Trp 


Gin 


Ser 


Tyr 


Gin 


Asn 








20 










25 










30 






Thr 


Glu 


Val 


Thr 


Leu 


He 


Asp 


His 


Ser 


Glu 


Glu 


He 


Phe 


Lys 


Thr 


Leu 






35 










40 










45 








Asn 


Tyr 


Leu 


Ser 


Asn 


Leu 


Leu 


His 


Ser 


lie 


Lys 


Asn 


Pro 


Leu 


Gly 


Thr 




50 










55 










60 










Arg 


Asp 


Asn 


Pro 


Ala 


Arg 


He 


Cys 


Lys 


Asp 


Leu 


Leu 


Asn 


Cys 


Glu 


Gin 


65 










70 










75 










80 


Lys 


Val 


Ser 


Asp 


Gly 


Lys 


Tyr 


Trp 


He 


Asp 


Pro 


Asn 


Leu 


Gly 


Cys 


Pro 










85 










90 










95 




Ser 


Asp 


Ala 


He 


Glu 


Val 


Phe 


Cys 


Asn 


Phe 


Ser 


Ala 


Gly 


Gly 


Gin 


Thr 








100 










105 










110 






Cys 


Leu 


Pro 


Pro 


Val 


Ser 


Val 


Thr 


Lys 


Leu 


Glu 


Phe 


Gly 


Val 


Gly 


Lys 






115 










120 










125 








Val 


Gin 


Met 


Asn 


Phe 


Leu 


His 


Leu 


Leu 


Ser 


Ser 


Glu 


Ala 


Thr 


His 


He 




130 










135 










140 










lie 


Thr 


lie 


His 


Cys 


Leu 


Asn 


Thr 


Pro 


Arg 


Trp 


Thr 


Ser 


Thr 


Gin 


Thr 


145 










150 










155 










160 


Ser 


Gly 


Pro 


Gly Leu 


Pro 


lie 


Gly 


Phe 


L YS 


Gly 


Trp 


Asn 


Gly 


Gin 


He 










165 










170 










175 




Phe 


Lys 


Val 


Asn 


Thr 


Leu 


Leu 


Glu 


Pro 


Lys 


val 


Leu 


Ser 


Asp 


Asp 


Cys 








180 










185 










190 






Lys 


He 


Gin 


Asp 


Gly 


Ser 


Trp 


His 


Lys 


Ala 


Thr 


Phe 


Leu 


Phe 


His' 


Thr 






195 










200 










205 








Gin 


Glu 


Pro 


Asn 


Gin 


Leu 


Pro 


Val 


He 


Glu 


Val 


Gin 


Lys 


Leu 


Pro 


His 




210 










215 










220 










Leu 


Lys 


Thr 


Glu Arg 


Lys 


Tyr 


Tyr 


He 


Asp 


Ser 


Ser. 


Ser 


Val 


Cys 


Phe 


225 










230 










235 










240 



Leu 

<210> 3 
<211> 723 

3 



BNSDOCID: <WO _0100673A1 JB> 



<212> DNA 

<213> Homo sapiens 

<400> 3 

atggaaacac attcttctcc tgccttggcc catgttggtc ctcaggattt ttttgtttat 60 

ataattctta tgatgacttg gcagagctac cagaatactg aagtgacttt aattgaccac 120 

agtgaagaga tattcaaaac cctgaactac cttagcaatt tattgcacag catcaagaat 180 

cctcttggca cacgagataa cccagcacga atctgcaaag atttacttaa ctgtgaacaa 240 

aaagtatcag atggaaaata ctggattgac ccaaatcttg gctgtccttc agatgccatt 3 00 

gaggttttct gcaatttcag tgctggtggc cagacatgct tacctcctgt ttctgtaaca 360 

aagttggagt ttggagttgg gaaagtccag atgaacttcc ttcatttact gagttcggaa 420 

gccacccata tcatcaccat tcactgtcta aacaccccaa ggtggacaag cacacaaaca 480 

agtggcccag gattgcctat tggtttcaag ggatggaatg gccagatttt taaagtaaac 540 

actctacttg aacctaaagt gctttcagat gactgcaaga ttcaagatgg cagctggcat 600 

aaggcaacat ttctttttca cacccaggaa cctaatcaac ttccagtgat tgaagtacaa 660 

aaacttcctc atctcaaaac tgaacgaaag tattacattg acagcagttc tgtatgcttt 720 

ctg . ^ 723 



<210> 4 

<211> 3169 

<2X2> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

*222> (57) . .'. (1568) 



<400> 4 

gtcgacccac gcgtccgcgc ccgctgagcc ccccgccgag gtccggacag gccgag atg 59 

Met 
1 

acg ccg age ccc ctg ttg ctg etc ctg ctg ccg ccg ctg ctg ctg ggg 107 
Thr Pro Ser Pro Leu Leu Leu Leu Leu Leu Pro Pro Leu Leu Leu Gly 
5 10 15 

gec ttc ccg ccg gec gec gec gec cga ggc ccc cca aag atg gcg gac 155 
Ala Phe Pro Pro Ala Ala Ala Ala Arg Gly Pro Pro Lys Met Ala Asp 
20 25 30 

aag gtg gtc cca egg cag gtg gec egg ctg ggc cgc act gtg egg ctg 203 
Lys Val Val Pro Arg Gin Val Ala Arg Leu Gly Arg Thr Val Arg Leu 
35 40 45 

cag tgc cca gtg gag ggg gac ccg ccg ccg ctg acc atg tgg acc aag 251 
Gin Cys Pro Val Glu Gly Asp Pro Pro Pro Leu Thr Met Trp Thr Lys 
50 55 60 65 

gat ggc cgc acc ate cac age ggc tgg age cgc ttc cgc gtg ctg ccg 299 
Asp Gly Arg Thr lie His Ser Gly Trp Ser Arg Phe Arg Val Leu Pro 
70 75 80 

cag ggg ctg aag gtg aag cag gtg gag egg gag gat gee ggc gtg tac 347 
Gin Gly Leu Lys Val Lys Gin Val Glu Arg Glu Asp Ala Gly Val Tyr 
85 90 95 

gtg tgc aag gec acc aac ggc ttc ggc age ctg age gtc aac tac acc 3 95 

Val Cys Lys Ala Thr Asn Gly Phe Gly Ser Leu Ser Val Asn Tyr Thr 
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100 



105 



110 



etc gtc gtg ctg gat gac att age cca ggg aag gag age ctg ggg ccc 443 

Leu Val Val Leu Asp Asp He Ser Pro Gly Lys Glu Ser Leu Gly Pro 
115 120 125 

gac age tec tct ggg ggt caa gag gac ccc gec age cag cag tgg gca 491 

Asp ser Ser Ser Gly Gly Gin Glu Asp Pro Ala Ser Gin Gin Trp Ala 
130 135 140 " 145 

cga ccg cgc ttc aca cag ccc tec aag atg agg cgc egg* gtg ate gca 539 

Arg Pro Arg Phe Thr Gin Pro Ser Lys Met Arg Arg Arg Val lie Ala 
150 155 160 

egg ccc gtg ggt age tec gtg egg etc aag tgc gtg gee age ggg cac 5 87 

Arg Pro Val Gly Ser Ser Val Arg Leu Lys Cys Val Ala Ser Gly His 

165 170 175 

cct egg ccc gac ate acg tgg atg aag gac gac cag gee ttg acg cgc 635 

Pro Arg Pro Asp He Thr Trp Met Lys Asp Asp Gin Ala Leu Thr Arg 
180 185 190 

cca gag gec get gag ccc agg aag aag aag tgg aca ctg age ctg aag 683 

Pro Glu Ala Ala Glu Pro Arg Lys Lys Lys Trp Thr Leu Ser Leu Lys 
195 200 "* ~ 205 

aac ctg egg ccg gag gac age ggc aaa tac ace tgc cgc gtg teg aac 731 

Asn Leu Arg Pro Glu Asp Ser Gly Lys Tyr Thr Cys Arg Val Ser Asn 
210 215 220 225 

cgc gcg ggc gec ate aac gec acc tac aag gtg gat gtg ate cag egg 779 

Arg Ala Gly Ala lie Asn Ala Thr Tyr Lys Val Asp Val He Gin Arg 
230 235 240 

acc cgt tec aag ccc gtg etc aca ggc acg cac ccc gtg aac acg acg .827 

Thr Arg Ser Lys Pro Val Leu Thr Gly Thr His Pro Val Asn Thr Thr 

245 250 255 

gtg gac ttc ggg ggg acc acg tec ttc cag tgc aag gtg cgc age gac 875 

Val Asp Phe Gly Gly Thr Thr Ser Phe Gin Cys Lys Val Arg Ser Asp 
260 265 270 

gtg aag ccg gtg ate cag tgg ctg aag cgc gtg gag tac ggc gee gag 923 

Val Lys Pro Val He Gin Trp Leu Lys Arg Val Glu Tyr Gly Ala Glu 
275 280 285 

ggc cgc cac aac tec acc ate gat gtg ggc ggc cag aag ttt gtg gtg 971 

Gly Arg His Asn Ser Thr He Asp Val Gly Gly Gin Lys Phe Val Val 
290 295 300 305 

ctg ccc acg ggt gac gtg tgg teg egg ccc gac ggc tec tac etc aat 1019 

Leu Pro Thr Gly Asp Val Trp Ser Arg Pro Asp Gly Ser Tyr Leu Asn 
310 315 320 

aag ctg etc ate acc cgt gec cgc cag gac gat gcg ggc atg tac ate 1067 

Lys Leu Leu He Thr Arg Ala Arg Gin Asp Asp Ala Gly Met Tyr He 

325 330 335 
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tgc ctt ggc gcc aac acc atg ggc tac age ttc cgc age gcc ttc etc 1115 
Cys Leu Gly Ala Asn Thr Met Gly Tyr Ser Phe Arg Ser Ala Phe Leu 
340 345 350 

acc gtg ctg cca gaC cca aaa ccg cca ggg cca cct gtg gcc tec teg 1163 
Thr Val Leu Pro Asp Pro Lys Pro Pro Gly . Pro Pro Val Ala Ser Ser 
355 360 365 

tec teg gcc act age ctg ccg tgg ccc gtg gtc ate ggc ate cca gcc 1211 
Ser Ser Ala Thr Ser Leu Pro Trp Pro Val Val lie Gly He Pro Ala 
370 375 380 385 

ggc get gtc ttc ate ctg ggc acc ctg etc ctg tgg ctt tgc cag gcc 1259 
Gly Ala Val Phe He Leu Gly Thr Leu Leu Leu Trp Leu Cys Gin Ala 
390 395 " 400 

cag aag aag ccg tgc acc ccc gcg cct gcc cct ccc ctg cct ggg cac 13 07 
Gin Lys Lys Pro Cys Thr Pro Ala Pro Ala Pro Pro Leu Pro Gly His 
405 410 415 

cgc ccg ccg ggg acg gcc cgc gac cgc age gga gac aag gac ctt ccc 1355 
Arg Pro Pro Gly Thr Ala Arg Asp Arg Ser Gly Asp Lys Asp Leu Pro 
420 425 430 

teg ttg gcc gcc etc age get ggc cct ggt gtg ggg ctg tgt gag gag 1403 
Ser Leu Ala Ala Leu Ser Ala Gly Pro Gly Val Gly Leu Cys Glu Glu 
435 440 445 

cat ggg tct ccg gca gcc ccc cag cac tta ctg ggc cca ggc cca gtt 1451 
His Gly Ser Pro Ala Ala Pro Gin His Leu Leu Gly Pro Gly Pro Val 
450 455 460 465 

get ggc cct aag ttg tac ccc aaa etc tac aca gac ate cac aca cac 1499 
Ala Gly Pro Lys Leu Tyr Pro Lys Leu Tyr Thr Asp lie His Thr His 
470 475 .480 

aca cac aca cac fcet cac aca cac tea cac gtg gag ggc aag gtc cac 1547 
Thr His Thr His Ser His Thr His Ser His Val Glu Gly Lys Val His 
485 490 495 

cag cac ate cac tat cag tgc tagaeggcac egtatctgea gtgggcacgg 15 98 
Gin His lie His Tyr Gin Cys 
500 

gggggcegge cagacaggca gactgggagg atggaggacg gagctgeaga cgaaggcagg 1658 

ggacccatgg cgaggaggaa tggccagcac cccaggcagt ctgtgtgtga ggcatagccc 1718 

ctggacacac acacacagac acacacactg ectggatgea tgtatgeaca cacatgcgcg 1778 

cacacgtgct ccctgaaggc acacgtacgc acacacgcac atgeacagat atgccgcctg 183 8 

ggcacacaga taagctgccc aaatgeaege acacgcacag agacatgeca gaacatacaa 18 98 

ggacatgctg cctgaacata cacacgcaca cccatgcgca gatgtgctgc ctggacacac 1958 

acacacacac ggatatgctg tetggacgea cacacgtgca gatatggtat ccggacacac 2 018 

acgtgcacag atatgetgee tggacacaca gataatgetg ccttgacaca cacatgcacg 2 078 

gatattgect ggaeacacac acacacacgt gtgeacagat atgctgtctg gacacgcaca 2138 

cacatgeaga tatgetgect ggaeacacac ttccagacac acgtgcacag gegcagatat 2198 

gctgcctgga cacacgcaga tatgetgtet agtcacacac acacgcagac atgctgtccg 2258 

gacacacaca cgcatgcaca gatatgetgt ccggacacac acacgcacgc agatatgetg 2318 

cctggacaca cacacagata atgctgcctc aacactcaca caegtgeaga tattgectgg 2378 

acacacacat gtgeacagat atgctgtctg gaeatgeaca caegtgeaga tatgetgtec 2438 
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ggatacacac gcacgcacac atgcagatat gctgcctggg cacacacttc 
tgcacacaca ggtgcagata tgctgcctgg acacacgcag actgacgtgc 
gtgtgccgtg aagcctgcag tacgtgtgcc gtgaggctca tagttgatga 
ctgctccacc gtcactcccc caactctgcc cgcctctgtc cccgcctcag 
catccccgcc tctgtcccct ggccttggcg gctatttttg ccacctgcct 
ggagtcccct actgctgtgg gctggggttg ggggcacagc agccccaagc 
ggagcccatg gctagtggct catccccact gcattctccc cctgacacag 
ttggtattta tatttaagaa atgaagataa tattaataat gatggaagga 
gcagggactg tggtctctcc tggggcccgg gacccgcctg gtctttcagc 
accacacccc gtccaggcca gacaccaccc cccaccccac tgtcgtggtg 
tctgtaattt tatgtagagt ttgagctgaa gccccgtata tttaatttat 
atgaaagtgc aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 
agggcggccg c 

<210> 5 

<2ll> 504 

<212> PRT 

<213> Homo sapiens 



<400> 5 



Met 


Thr 


Pro Ser Pro Leu Leu Leu 


Leu Leu 


Leu Pro 


Pro 


Leu 


Leu 


Leu 


1 




5 


10 








15 




Gly 


Ala 


Phe Pro Pro Ala Ala Ala Ala Arg 


Gly Pro 


Pro 


Lys 


Met 


Ala 






20 


25 






30 






Asp 


Lys 


Val Val Pro Arg Gin Val 


Ala Arg 


Leu Gly 


Arg 


Thr 


Val 


Arg 






35 40 






45 








Leu 


Gin 


Cys Pro Val Glu Gly Asp 


Pro Pro 


Pro Leu 


Thr 


Met 


Trp 


Thr 




50 


55 




60 










Lye 


Asp 


Gly Arg Thr He His Ser Gly Trp 


Ser Arg 


Phe 


Arg 


Val 


Leu 


65 




70 




75 








80 


Pro 


Gin 


Gly Leu Lys Val Lys Gin 


Val Glu 


Arg Glu 


Asp 


Ala 


Gly 


Val 






85 


90 








95 




Tyr 


Val 


Cys Lys Ala Thr Asn Gly 


Phe Gly 


ser Leu 


Ser 


Val 


Asn 


Tyr 






100 


105 






110 






Thr 


Leu 


Val Val Leu Asp Asp He 


Ser Pro 


Gly Lys 


Glu 


Ser 


Leu 


Gly 






115 120 






125 








Pro 


Asp 


Ser Ser Ser Gly Gly Gin Glu Asp 


Pro Ala 


Ser 


Gin 


Gin 


Trp 




130 


135 




140 










Ala 


Arg 


Pro Arg Phe Thr Gin Pro 


Ser Lys 


Met Arg 


Arg 


Arg 


val 


He 


145 




150 




155 








160 


Ala 


Arg 


Pro Val Gly ser Ser val 


Arg Leu 


Lys Cys 


Val 


Ala 


Ser 


Gly 






165 


170 








175 




His 


Pro 


Arg Pro Asp He Thr Trp 


Met Lys 


Asp Asp 


Gin 


Ala 


Leu 


Thr 






180 


IBS 






190 






Arg 


Pro 


Glu Ala Ala Glu Pro Arg 


Lys Lys 


Lys Trp 


Thr 


Leu 


Ser 


Leu 






195 200 






205 








Lya 


Asn 


Leu Arg Pro Glu Asp Ser Gly Lys 


Tyr Thr 


Cys 


Arg 


Val 


Ser 




210 


215 




220 










Asn 


Arg 


Ala Gly Ala He Asn Ala 


Thr Tyr 


Lys Val 


Asp 


Val 


He 


Gin 


225 




230 




235 








240 


Arg 


Thr 


Arg Ser Lys Pro Val Leu 


Thr Gly 


Thr His 


Pro 


Val 


Asn 


Thr 






245 


250 








255 




Thr 


Val 


Asp Phe Gly Gly Thr Thr 


Ser Phe 


Gin Cys 


Lys 


Val 


Arg 


Ser 






260 


265 






270 






Asp 


Val 


Lys Pro Val He Gin Trp 


Leu Lys 


Arg Val 


Glu 


Tyr 


Gly 


Ala 






275 280 






285 








Glu 


Gly 


Arg His Asn Ser Thr He Asp Val 


Gly Gly 


Gin 


Lys 


Phe 


Val 




290 


295 




300 











cggacacaca 2498 

ttttgggagg 2558 

gggactttcc 2618 

tccccgcctc 2678 

tgggtgccca 273 8 

ctgagaggct 2798 

agaaggggcc 285 8 

agactgggtt 2918 

catgctgatg 297 8 

gccccagatc 3038 

tttgttaaac 3 098 

aaaaaaaaaa- 3158 
3169 
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val 


Leu 


Pro 


Thr 


Gly 


Asp 


Val 


Trp 


Ser 


Arg 


Pro 


Asp 


Gly 


Ser 


Tyr 


Leu 


305 










■si 




















320 


Asil 


Lys 


Leu 


Leu 


lie 
325 


Thr 


Arg 


Ala 


Arg 


Gin 

"3 1 rt 
3 3 V 


Asp 


ASp 


Ala 


Gly 


Met 
335 


Tyr 


lie 




eu 


Gly 

1 A C\ 

J % yJ 


Ala 


Asn 






i a ez 


Tyr 


Ser 


Jtrue 


Arg 


Ser 
350 




JrNue 


Leu 


Thr 


Val 
3 55 


Leu 


Pro 


Asp 


Pro 


Lys. 
360 


Pro 


Pro 


Gly 


Pro 


Pro 
365 


Val 


Ala 


Ser 


Ser 


Ser 


Ser 


Ala 


Thr 


Ser 


Leu 
375 


Pro 


Trp 


Pro 


val 


Val 
380 


lie 


Gly 


lie 


Pro 


Ala 


Gly 


Ala 


Val 


Phe 


lie 


Leu 


Gly 


Thr 


Leu 


Leu 


Leu 


Trp 


Leu 


Cys 


Gin 


385 










390 










395 










400 


Ala 


Gin 


Lys 


Lys 


Pro 
405 


Cys 


Thr 


Pro 


Ala 


Pro 
410 


Ala 


Pro 


Pro 


Leu 


Pro 
415 


Gly 


His 


Arg 


Pro 


Pro 
420 


Gly 


Thr 


Ala 


Arg 


Asp 
425 


Arg 


Ser 


Gly 


Asp 


Lys 
430 


Asp 


Leu 


Pro 


Ser 


Leu 
435 


Ala 


Ala 


Leu 


Ser 


Ala 
440 


Gly 


Pro 


Gly 


Val 


Gly 
445 


Leu 


Cys 


Glu 


Glu 


His 
450 


Gly 


Ser 


Pro 


Ala 


Ala 
455 


Pro 


Gin 


Hie 


Leu 


Leu 
460 


Gly 


Pro 


Gly 


Pro 


val 


Ala 


Gly 


Pro 


Lys 


Leu 


Tyr 


Pro 


L Y S 


Leu 


Tyr 


Thr 


Asp 


He 


His 


Thr 


465 










470 










475 










480 


His 


Thr 


His 


Thr 


His 
485 


Ser 


His 


Thr 


His 


Ser 
490 


His 


Val 


Glu 


Gly 


Lys 
495 


Val 


His 


Gin 


His 


lie 
500 


His 


Tyr 


Gin 


Cys 



















<210> 6 

<211> 1512 

<212> DNA 

<213> Homo sapiens 



<400> 6 

atgaegcega gccccctgtt gctgctcctg ctgccgccgc tgctgctggg ggcctfccccg 60 

ccggcegccg ccgcccgagg ccccccaaag atggcggaca aggtggtccc aeggcaggtg 120 

gcccggctgg gccgcactgt gcggctgcag tgcccagtgg agggggaccc gccgccgctg 180 

accatgtgga ccaaggatgg ccgcaccatc cacagcggct ggagecgett ccgcgtgatg 240 

ccgcaggggc tgaaggtgaa gcaggtggag egggaggatg ccggcgtgta cgtgtgcaag 3 00 

gccaccaacg getteggcag cctgagcgtc aactacaccc tegtegtget ggatgacatt 3 60 

ageccaggga aggagagect ggggcccgac agctcctctg ggggtcaaga ggaccccgcc 42 0 

agecagcagt gggcacgacc gcgcttcaca cagccctcca agatgaggcg ccgggtgatc 4 80 

gcaeggcccg tgggtagctc cgtgcggctc aagtgcgtgg ccagcgggca ccctcggccc 540 

gaeatcaegt ggatgaagga cgaccaggcc ttgacgcgcc cagaggccgc tgageccagg 600 

aagaagaagt ggacactgag cctgaagaac ctgeggeegg aggacagegg caaatacacc 660 

tgccgcgtgt cgaaccgcgc gggcgccatc aacgccacct acaaggtgga tgtgatccag 720 

cggacccgtt ccaagcccgt gctcacaggc acgcaccccg tgaacacgac ggtggacttc 7 80 

ggggggacca cgtccttcca gtgcaaggtg egcagcgacg tgaagccggt gatccagtgg 840 

cfcgaagcgcg tggagtaegg cgccgagggc cgccacaact ccaccatcga tgtgggcggc 900 

cagaagtttg tggtgctgcc caegggtgae gtgtggtcgc ggcccgacgg ctcctacctc 960 

aataagctgc tcatcacccg tgcccgccag gaegatgegg gcatgtacat ctgccttggc 1020 

gccaacacca tgggctaoag cttccgcagc gccttcctca ccgtgctgcc agacccaaaa 10 80 

ccgccagggc cacctgtggc ctcctcgtcc tcggccacta gcctgccgtg gcecgtggtc 1140 

atcggcatcc cagccggcgc tgtcttcatc ctgggcaccc tgctcctgtg getttgecag 12 00 

geccagaaga agccgtgcac ccccgcgcct gcccctcccc tgcctgggca ccgcccgccg 12 60 

gggacggccc gcgacegcag eggagacaag gaccttccct cgttggccgc cctcagcgct 1320 

ggccctggtg tggggctgtg tgaggagcat gggtctcegg cagcccccca gcacttactg 13 8 0 

ggcccaggcc. cagttgctgg ccctaagttg taccccaaac tctacacaga catceacaca 1440 

cacacacaca cacactctca cacacactca cacgtggagg gcaaggtcca ccagcacatc 1500 
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cactatcagt gc 



1512 



<210> 7 

<211> 1074 

<212> DNA 

<213> Mus mus cuius 

<220> 

<221> CDS 

<222> (3) . . . (626) 

<221> modif ied_base 
<222> all "n" positions 
<22 3> n=a, c, g, or t 

<400> 7 

ca cgc gtc egg ccc acg ggt gat gtg tgg tea egg cct gat ggc tec 47 

Arg Val Arg Pro Thr Gly Asp Val Trp Ser Arg Pro Asp Gly Ser 
1 5 10 15 

tac etc aac aag ctg etc ate tct egg gee cgc cag gat gat get ggc 95 
Tyr Leu Asn Lys Leu Leu lie Ser Arg Ala Arg Gin Asp Asp Ala Gly 
20 25 30 

atg tac ate tgc eta ggt gca aat acc atg ggc tac agt ttc cgt age 14 3 

Met Tyr lie Cys Leu Gly Ala Asn Thr Met Gly Tyr Ser Phe Arg Ser 
35 40 45 

gee ttc etc act gta tta cca gac ccc aaa cct cca ggg cct cct atg 191 
Ala Phe Leu Thr Val Leu Pro Asp Pro Lys Pro Pro Gly Pro Pro Met 
50 55 60 

get tct tea teg tea tec aca age ctg cca tgg cct gtg gtg ate ggc 23 9 

Ala Ser Ser Ser Ser Ser Thr Ser Leu Pro Trp Pro Val Val lie Gly 
65 70 75 

ate cca get ggt get gtc ttc ate eta ggc act gtg ctg etc tgg ctt 287 
He Pro Ala Gly Ala Val Phe He Leu Gly Thr Val Leu Leu Trp Leu 
80 85 90 95 

tgc cag acc aag aag aag cca tgt gec cca gca tct aca ctt cct gtg 335 
Cys Gin Thr Lys Lys Lys Pro Cys Ala Pro Ala Ser Thr Leu Pro Val 
100 105 110 

cct ggg cat cgt ccc cca ggg aca tec cga gaa cgc agt ggt gac aag 3 83 

Pro Gly His Arg Pro Pro Gly Thr Ser Arg Glu Arg Ser Gly Asp Lys 
115 120 125 

gac ctg ccc tea ttg get gtg ggc ata tgt gag gag cat gga tec gec 431 
Asp Leu Pro Ser Leu Ala Val Gly He Cys Glu Glu His Gly Ser Ala 
130 135 * 140 

atg gee ccc cag cac ate ctg gec tct ggc tea act get ggc ccc aag 479 
Met Ala Pro Gin His He Leu Ala Ser Gly Ser Thr Ala Gly Pro Lys 
145 150 155 

ctg tac ccc aag eta tac aca gat gtg cac aca cac aca cat aca cac 527 
Leu Tyr Pro Lys Leu Tyr Thr Asp Val His Thr His Thr His Thr His 
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160 



165 



170 



175 



ace tgc act cac acg etc tea tgt tgg agg gea agg ttc ate aac acc 5 75 

Thr Cys Thr His Thr Leu Ser Cys Trp Arg Ala Arg Phe lie Asn Thr 

180 185 190 

age atg tec act ate agt get aaa tac age gaa tct cca age act gtg 623 

Ser Met Ser Thr lie Ser Ala Lys Tyr Ser Glu Ser Pro ser Thr Val 

195 200 205 

tec tgaggtaggc atttgggggc caaggcaaca ggttgggaga attgagaaca 676 
Ser 



atggaggaag agtatcttag ggtgccttat ggtggacact cacaaacttg gecatataga 736 

tgtatgtact accagatgaa cagccagcca gattcaeaca cgcacatgtt taaacgtgta 796 

aacgtgtgca caactgcaca cacaacctga gaaaccttca ggaggatttg tggtgtgact 856 

ttgcagtgac atgtagcgat ggctagttga aggaatctcc ctcatgtctt agtggtcatg 916 

gccacttcce cacccctgcc catctgtgtt cctgcctggc cttggtggtg cttccgtgtg 976 

ccctgggttt tccaggaacc ctatcaacct gactggggtg agcagtgcag ecatgentgg 1036 

aggtttgagc caccctcccc ttgetagaga gaagggen 1074 

<210> 8 
<211> 208 
<212> PRT 

<213> Mas mus cuius 



<400> 8 



Arg 


Val 


Arg 


Pro 


Thr 


Gly 


Asp 


Val 


Trp 


Ser Arg Pro Asp Gly Ser Tyr 


1 








5 










10 ~ 15 


Leu 


Asn 


Lys 


Leu 


Leu 


He 


Ser 


Arg 


Ala 


Arg Gin Asp Asp Ala Gly Met 








20 










25 


30 


Tyr 


lie 


Cys 


Leu 


Gly 


Ala 


Asn 


Thr 


Met 


Gly Tyr Ser Phe Arg Ser Ala 






35 










40 




.45 


Phe 


Leu 


Thr 


Val 


Leu 


Pro 


Asp 


Pro 


Lys 


Pro Pro Gly Pro Pro Met Ala 




50 










55 






60 


Ser 


Ser 


Ser 


Ser 


Ser 


Thr 


Ser 


Leu 


Pro 


Trp Pro Val Val He Gly He 


65 










70 








75 80 


Pro 


Ala 


Gly 


Ala 


Val 


Phe 


Xle 


Leu 


Gly 


Thr Val Leu Leu Trp Leu Cys 










85 










90 95 


Gin 


Thr 


Lys 


Lys 


Lys 


Pro 


Cys 


Ala 


Pro 


Ala Ser Thr Leu Pro Val Pro 








100 










105 


110 


Gly 


His 


Arg 


Pro 


Pro 


Gly 


Thr 


Ser 


Arg 


Glu Arg Ser Gly Asp Lys Aep 






115 










120 




125 


Leu 


Pro 


Ser 


Leu 


Ala 


Val 


Gly 


He 


Cys 


Glu Glu His Gly Ser Ala Met 




130 










135 






140 


Ala 


Pro 


Gin 


His 


lie 


Leu 


Ala 


Ser 


Gly 


Ser Thr Ala Gly Pro Lys Leu 


145 










150 






155 160 


Tyr 


Pro 


Lys 


Leu 


Tyr 


Thr 


Asp 


Val 


His 


Thr His Thr His Thr His Thr 










165 










170 175 


Cys 


Thr 


His 


Thr 


Leu 


ser 


Cys 


Trp 


Arg 


Ala Arg Phe He Asn Thr Ser 








180 










185 


190 


Met 


Ser 


Thr 


He 


Ser 


Ala 


Lys 


Tyr 


Ser 


Glu Ser Pro Ser Thr Val Ser 






195 










200 




205 



<210> 9 
<211> 624 
<212> DNA 
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<:213> Mus mus cuius 



*400> 9 

cgcgtccggc ccacgggtga tgtgtggtca cggcctgatg gctcctacct caacaagctg 60 

ctcatctctc gggcccgcca ggatgatgct ggcatgtaca tctgcctagg tgcaaatacc 12 0 

atgggctaca gtttcagtag cgccttcctc actgtattac cagaccccaa acctccaggg 180 

cctcctatgg cttcttcatc gtcatccaca agcctgccat ggcctgtggt gatcggcatc 240 

ccagctggtg ctgtcttcat cctaggcact gtgctgctct ggctttgcca gaccaagaag 3 00 

aagccatgtg ccccagcatc tacacttcct gtgcctgggc atcgtccccc agggacatcc 360 

cgagaacgca gtggtgacaa ggacetgccc tcattggctg tgggcatatg tgaggagcat 42 0 

ggatccgcca tggcccccca gcacatcctg gcctctggct caactgctgg acccaagctg 4 80 

taccccaagc tatacacaga tgtgcacaca cacacacata cacacacctg cactcacacg 540 

ctctcatgtt ggagggcaag gttcatcaac accagcatgt ccactatcag tgctaaatac 600 

agcgaatctc caagcactgt gtcc 624 

<210> 10 

<211> 1423 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (31) . . . (444) 

<400> 10 

gtcgacccac gcgtccgccc acgcgtccgg atg act gga ccc aga gtg tgg ggg 54 

Met Pro Gly Pro Arg Val Trp Gly 
1 5 

aaa tat etc tgg aga age cct cac tec aaa ggc tgt cca ggc gca atg 102 
Lys Tyr Leu Trp Arg Ser Pro His Ser Lys Gly Cys Pro Gly Ala Met 
10 15 20 

tgg tgg ctg ctt etc tgg gga gtc etc cag get tgc cca ace egg ggc 15 0 

Trp Trp Leu Leu Leu Trp Gly Val Leu Gin Ala Cys Pro Thr Arg Gly 
25 30 35 40 

tec gtc etc ttg gec caa gag eta ccc cag cag ctg aca tec ccc ggg 198 
Ser Val Leu Leu Ala Gin Glu Leu Pro GXn Gin Leu Thr Ser Pro Gly 
45 50 55 

tac cca gag ccg tat ggc aaa ggc caa gag age age acg gac ate aag 246 
Tyr Pro Glu Pro Tyr Gly Lys Gly Gin Glu Ser Ser Thr Asp lie Lys 
60 65 70 

get cca gag ggc ttt get gtg agg etc gtc ttc cag gac tte gac ctg 2 94 

Ala Pro Glu Gly Phe Ala Val Arg Leu Val Phe Gin Asp Phe Asp Leu 
75 80 85 

gag ccg tec cag gac tgt gca ggg gac tct gtc aca gtg age tgg gga 342 
Glu Pro Ser Gin Asp Cys Ala Gly Asp Ser Val Thr Val Ser Trp Gly 
90 95 100 

tgg ggg ggg tec cgc cag gac tgt ggc cag gga gat tec egg ggt tgt 3 90 

Trp Gly Gly Ser Arg Gin Asp Cys Gly Gin Gly Asp Ser Arg Gly Cys 
105 110 115 120 

ggg aag tgg egg tgc cct gaa tec ccc ate tgg agg agg gat gaa ttt 43 8 
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Gly Lys Trp Arg Cys Pro Glu Ser Pro lie Trp Arg Arg Asp Glu Phe 
125 130 135 



tec atg taggggcagt egggcttgge ttaccgggga gcagtggtgg accccaggac 
Ser Met 



494 



acagcctccc 
caagcaggcc 
agecagggge 
tgcctgctct 
tcacacttac 
aggacatttt 
tcaggagact 
caacatagca 
attatctttt 
tttaatttta 
taattaaagt 
aggctgaggc 
aagaccccat 
gtcccagcta 
cagtgagctc 
caaaaaaaaa 



accagcgcct 
ctgcgtttgg 
agggaegcac 
atcaggtgag 
cttggaagag 
aaaaacagta 
gaggctggag 
agaccccatc 
gatttaaatt 
aaatttttaa 
tttctaaggc 
gaaagaagca 
ctctacaaaa 
oaagggaege 
tgatcatgac 
aaaaaaaaag 



ccggggctgc 
aaggcttatg 
atattggttg 
gaagctggac 
ctattacaaa 
cttgatggag 
gatcagaggg 
tcaaaaataa 
ttatttatat 
ttattatgga 
tgggcgcagt 
cttgagccca 
aaatttaaaa 
tgaagtgaga 
accgtactcc 
ggcggccgc 



catotgggee 
aatggacaca 
ttaaaaatat 
acaaataata 
acttctaacg 
tgatgeaage 
ctggagccca 
gtaaataata 
caaaatgaca 
tacataatag 
agctcatgtc 
ggaatttgag 
attagecaag 
ggatcacttg 
agcctgggtg 



ccacagagca 
caaatcttgc 
gtcatcatgt 
acaaaagatt 
ccaaagcctt 
ttgcagtccc 
gggttcaagg 
aataaaaata 
taaatttttg 
ttgtaagact 
tgtagtccca 
accagcctgg 
tgtggtggca 
agcctggaag 
acagagtgag 



aagagggcag 
aaatctatgg 
atttgttgag 
aagtcaccgt 
attcagaata 
agcagtatag 
ccagcctaag 
aaaagagcac 
aactttattt 
ttttgttttt 
gcactttggg 
gcaacatagc 
cgcacctgtg 
gtagaggctg 
acectgtctc 



554 
614 
674 
734 
794 
854 
914 
974 
1034 
1094 
1154 
1214 
1274 
1334 
13 94 
1423 





<210> 


11 




























<211> 


138 




























<212> 


PRT 




























<213> 


Homo sapiens 






















<400> 


11 


























Met 


Pro 


Gly 


Pro 


Arg 


Val 


Trp 


Gly 


Lys 


Tyr 


Leu 


Trp 


Arg 


Ser 


Pro 


His 


1 








5 










10 










15 




Ser 


Lys 


Gly 


Cys 


Pro 


Gly 


Ala 


Met 


Trp 


Trp 


Leu 


Leu 


Leu 


Trp 


Gly 


Val 








20 










25 










30 






Leu 


Gin 


Ala 


Cys 


Pro 


Thr 


Arg 


Gly 


Ser 


Val 


Leu 


Leu 


Ala 


Gin 


Glu 


Leu 






35 










40 










45 








Pro 


Gin 


Gin 


Leu 


Thr 


Ser 


Pro 


Gly 


Tyr 


Pro 


Glu 


Pro 


Tyr 


Gly 


Lys 


Gly 




50 










55 










50 










Gin 


Glu 


Ser 


Ser 


Thr 


Asp 


He 


Lys 


Ala 


Pro 


Glu 


Gly 


Phe 


Ala 


Val 


Arg 


65 










70 










75 








80 


Leu 


Val 


Phe 


Gin 


Asp 


Phe 


Asp 


Leu 


Glu 


Pro 


Ser 


Gin 


Asp 


Cys 


Ala 


Gly 










85 










90 










95 




Asp 


Ser 


Val 


Thr 


Val 


Ser 


Trp 


Gly 


Trp 


Gly 


Gly 


Ser 


Arg 


Gin 


Asp 


cys 








100 










105 










110 






Gly 


Gin 


Gly 


Asp 


Ser 


Arg 


Gly 


Cys 


Gly 


Lys 


Trp 


Arg 


Cys 


Pro 


Glu 


ser 






115 










120 










125 








Pro 


lie 


Trp 


Arg 


Arg 


Asp 


Glu 


Phe 


Ser 


Met 
















130 










135 





















<210> 12 

<:211> 414 

<212> DNA 

<213> Homo sapiens 

<400> 12 

atgcctggac ccagagtgtg ggggaaatat ctctggagaa gccctcactc caaaggctgt 
ccaggcgcaa tgtggtggct gcttctctgg ggagtcctcc aggcttgccc aacccggggc 
tccgtcctct tggeccaaga gctaccccag cagctgacat cccccgggta cccagagccg 



60 
120 
180 
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tatggcaaag gccaagagag cagcacggac atcaaggctc cagagggctt tgctgtgagg 240 

ctcgtcttcc aggacttcga cctggagccg tcccaggact gtgcagggga ctctgtcaca 300 

gtgagctggg gatggggggg gtcccgccag gactgtggcc agggagattc ccggggttgt 360 

gggaagtggc ggtgccctga atcccccatc tggaggaggg atgaattttc catg 414 

<210> 13 

<211> 5036 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (230) . (3379) 



<400> 13 

gtcgacccac gcgtccgctc gaagcgggga ccctcgcccc gtcctcggct gtccagtcct 60 
cctcctcgca gaccccggcg gttcctaccc caggccgcag gggagacggt gccccaaggc 120 
aggcttcata tcctgaacgc tgggatcccc caggacattc cctggccccc aggccccagg 18 o 
tcccaggccc cagggctgag ctgtgggcag gccccacctg gcctctgca atg tea ccg 23 8 

Met Ser Pro 
1 



cct ctg tgt ccc etc ctt etc ctg get gtg ggc ctg egg ctg get gga 
Pro Leu Cys Pro Leu Leu Leu Leu Ala Val Gly Leu Arg Leu Ala Gly 
5 10 15 



286 



act etc aac ccc agt gat ccc aat acc tgc age ttc tgg gaa age ttc 
Thr Leu Asn Pro Ser Asp Pro Asn Thr Cys Ser Phe Trp Glu Ser Phe 
20 25 30 35 



334 



act acc acc acc aag gag tec cac tec cgc ccc ttc age ctg etc ccc 
Thr Thr Thr Thr Lys Glu Ser His Ser Arg Pro Phe Ser Leu Leu Pro 
40 45 50 



382 



tea gag ccc tgc gag egg ccc tgg gag ggc ccc cat act tgc ccc age 
Ser Glu Pro Cys Glu Arg Pro Trp Glu Gly Pro His Thr Cys Pro Ser 
55 60 65 



430 



cca caa act cag agg aaa etc ctg get tct agg gat tea ttc tgc atg 
Pro Gin Thr Gin Arg Lys Leu Leu Ala Ser Arg Asp Ser Phe Cys Met 
70 75 80 



478 



gtc tgt gtc ggg get gga gtg cag tgg cga gat cgt agt gca ctg caa 
Val Cys Val Gly Ala Gly Val Gin Trp Arg Asp Arg Ser Ala Leu Gin 
85 90 95 



526 



cct caa aca ggg aat gcg ctt tct atg cgc cct cag ccc aga gtg ttg 
Pro Gin Thr Gly Asn Ala Leu Ser Met Arg Pro Gin Pro Arg Val Leu 
100 105 110 115 



574 



agt ggt gee cct tec ctg gee tec cct ggc cac act gtg gtg gtg aag 
Ser Gly Ala Pro Ser Leu Ala Ser Pro Gly His Thr Val Val Val Lys 
120 125 130 



622 



acg gac cac cgc cag cgc ctg cag tgc tgc cat ggc ttc tat gag age 
Thr Asp His Arg Gin Arg Leu Gin Cys Cys His Gly Phe Tyr Glu Ser 
135 140 145 



670 
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agcf ggg ttc tgt gtc ccg etc tgt gec cag gag tgt gtc cat ggc cgt 71B 

Arg Gly Phe Cys Val Pro Leu Cys Ala Gin Glu Cys Val His Gly Arg 

150 155 160 

tgt gtg gca ccc aat cag tgc caa tgt gtg cca ggc tgg egg ggc gac 766 

Cys Val Ala Pro Asn Gin Cys Gin Cys Val Pro Gly Trp Arg Gly Asp 

165 170 175 

gac tgt tec agt gec ccg aac tgc ctt cag ccc tgt acc cct ggc tac 814 

Asp Cys Ser ser Ala Pro Asn Cys Leu Gin Pro Cys Thr Pro Gly Tyr 

180 185 190 195 

tat ggc cct gec tgc cag ttc cgc tgc cag tgc cat ggg gca ccc tgc 862 

Tyr Gly Pro Ala Cys Gin Phe Arg Cys Gin Cys His Gly Ala Pro Cys 

200 ' 205 210 

gat ccc cag act gga gec tgc ttc tgc ccc gca gag aga act ggg ccc 910 

Asp Pro Gin Thr Gly Ala Cys Phe Cys Pro Ala Glu Arg Thr Gly Pro 

215 220 225 

age tgt gac gtg tec tgt tec cag ggc act tct ggc ttc ttc tgc ccc 958 

Ser Cys Asp Val Ser Cys Ser Gin Gly Thr Ser Gly Phe Phe Cys Pro 

230 235 240 

age acc cat cct tgc caa aat gga ggt gte ttc caa acc cca cag ggc 1006 

Ser Thr His Pro Cys Gin Asn Gly Gly Val Phe Gin Thr Pro Gin Gly 

245 250 255 

toe tgc age tgc ccc cct ggc tgg atg ggc acc ate tgc tec ctg ccc 1054 

Ser Cys Ser Cys Pro Pro Gly Trp Met Gly Thr lie CyB Ser Leu Pro 

260 265 270 275 

tgc cca gag ggc ttt cac gga ccc aac tgc tec cag gaa tgt cgc tgc 1102 

Cys Pro Glu Gly Phe His Gly Pro Asn Cys Ser Gin Glu Cys Arg Cys 

280 285 290 

cac aac ggc ggc etc tgt gac cga ttc act ggg cag tgc cgc tgc get 1150 

His Asn Gly Gly Leu Cys Asp Arg Phe Thr Gly Gin Cys Arg Cys Ala 

295 300 305 

ccg ggt tac act ggg gat egg tgc egg gag gag tgc ccg gtg ggc cgc 1198 

Pro Gly Tyr Thr Gly Asp Arg Cys Arg Glu Glu Cys Pro Val Gly Arg 

310 315 320 

ttt ggg cag gac tgt get gag acg tgc gac tgc gec ccg gac gee cgt 1246 

Phe Gly Gin Asp Cys Ala Glu Thr Cys Asp Cys Ala Pro Asp Ala Arg 

325 " 330 335 

tgc ttc ccg gee aac ggc gca tgt ctg tgc gaa cac ggc ttc act ggg 1294 

Cys Phe Pro Ala Asn Gly Ala Cys Leu Cys Glu His Gly Phe Thr Gly 

340 345 350 355 

gac cgc tgc acg gat cgc etc tgc ccc gac ggc ttc tac ggt etc age 1342 

Asp Arg Cys Thr Asp Arg Leu Cys Pro Asp Gly Phe Tyr Gly Leu Ser 

360 365 " 370 

tgc cag gec ccc tgc acc tgc gac egg gag cac age etc age tgc cac 1390 

Cys Gin Ala Pro Cys Thr Cys Asp Arg Glu His Ser Leu Ser Cys His 
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375 380 385 

ccg atg aac ggg gag tgc tec tgc ctg ccg ggc tgg gcg ggc etc cac 14 3 s 

Pro Met Asn Gly Glu Cys Ser Cys Leu Pro Gly Trp Ala Gly Leu His 

390 395 * 400 

tgc aac gag age tgc ccg cag gac acg cat ggg cca ggg tgc cag gag 14 8 6 

Cys Asn Glu Ser Cys Pro Gin Asp Thr His Gly Pro Gly Cys Gin Glu 
405 410 415 

cac tgt etc tgc ctg cac ggt ggc gte tgc cag get acc age ggc etc 1534 

His Cys Leu Cys Leu His Gly Gly Val Cys Gin Ala Thr Ser Gly Leu 
420 425 430 435 

tgt cag tgc gcg ccg ggt tac acg ggc cct cac tgt get agt ctt tgt 15 82 

Cys Gin Cys Ala Pro Gly Tyr Thr Gly Pro His Cys Ala Ser Leu Cys 
440 445 450 

cct cct gac acc tac ggt gte aac tgt tct gca cgc tgc tea tgt gaa 163 0 

Pro Pro Asp Thr Tyr Gly Val Asn Cys Ser Ala Arg Cys Ser Cys Glu 

455 460 465 

aat gee ate gee tgc tea ccc ate gac ggc gag tgc gte tgc aag gaa 167 8 

Asn Ala lie Ala Cys Ser Pro lie Asp Gly Glu Cys Val Cys Lys Glu 

470 475 480 

9gt .tgg cag cgt ggt aac tgc tct gtg ccc tgc cea ccc gga ace tgg 1726 

Gly Trp Gin Arg Gly Asn Cys Ser Val Pro Cys Pro Pro Gly Thr Trp 
485 490 495 

ggc ttc agt tgc aat gee age tgc cag tgt gee cat gag gca gte tgc 1774 

Gly Phe Ser Cys Asn Ala Ser Cys Gin Cys Ala His Glu Ala Val Cys 
500 505 510 515 

age ccc caa act gga gec tgt acc tgc acc cct ggg tgg cat ggg gec 1822 

Ser Pro Gin Thr Gly Ala Cys Thr Cys Thr Pro Gly Trp His Gly Ala 

520 " 525 ' ' 530 

cac tgc cag ctg ccc tgt ccg aag ggg cag ttt gga gaa ggt tgt gec 187 0 

His Cys Gin Leu Pro Cys Pro Lys Gly Gin Phe Gly Glu Gly Cys Ala 

535 540 545 

agt cgc tgt gac tgt gac cac tct gat ggc tgt gac cct gtt cat gga 1918 

Ser Arg Cys Asp Cys Asp His Ser Asp Gly Cys Asp Pro Val His Gly 

550 555 ' 560 

cgc tgt cag tgc cag get ggc tgg atg ggt gec cgc tgc cac ctg tec 1966 

Arg Cys Gin Cys Gin Ala Gly Trp Met Gly Ala Arg Cys His Leu Ser 
565 570 575 

tgc cct gag ggc tta tgg gga gte aac tgt age aac acc tgc acc tgc 2014 

Cys Pro Glu Gly Leu Trp Gly Val Asn Cys Ser Asn Thr Cys Thr Cys 
580 585 590 595 

aag aat ggg ggc acc tgt etc cct gag aat ggc aac tgc gtg tgt gca 2062 

Lys Asn Gly Gly Thr Cys Leu Pro Glu Asn Gly Asn Cys Val Cys Ala 
600 605 610 
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ccc gga ttc egg ggc ccc tec tgc cag aga tec tgt cag cct ggc cge 2110 
Pro Gly Phe Arg Gly Pro Ser Cys Gin Arg Ser Cys Gin Pro Gly Arg 
615 620 ~ 625 

tat ggc aaa cgc tgt gtg ccc tgc aag tgc get aac cac tec ttc tgc 2158 
Tyr Gly Lys Arg Cys Val Pro Cys Lys Cys Ala Asn His Ser Phe Cys 
630 635 640 

cac ccc teg aac ggg ace tgc tac tgc ctg get ggc tgg aca ggc ccc 22 06 

His Pro Ser Asn Gly Thr Cys Tyr Cys Leu Ala Gly Trp Thr Gly Pro 
645 650 655 

gac tgc tec cag cca tgc cct cca gga cac tgg gga gaa aac tgt gee 2254 
Asp Cys Ser Gin Pro Cys Pro Pro Gly His Trp Gly Glu Asn Cys Ala 
660 665 670 675 

cag acc tgc caa tgt cac cat ggt ggg acc tgc cat ccc cag gat ggg . 2302 
Gin Thr Cys Gin Cys His His Gly Gly Thr Cys His Pro Gin Asp Gly 
680 685 * • 690 

age tgt ate tgc ccc eta ggc tgg act gga cae cac tgc tta gaa ggc 2350 
Ser Cys lie Cys Pro Leu Gly Trp Thr Gly His His Cys Leu Glu Gly 
695 700 705 

tgc cct ctg ggg aca ttt ggt get aac tgc tec cag cca tgc cag tgt 23 98 
Cys Pro Leu Gly Thr Phe Gly Ala Asn Cys Ser Gin Pro Cys Gin Cys 
710 715 720 

ggt cct gga gaa aag tgc cac cca gag act ggg gee tgt gta tgt ccc 2446 
Gly Pro Gly Glu Lys Cys His Pro Glu Thr Gly Ala Cys Val Cys Pro 
725 730 735 

cca ggg cac agt ggt gca cct tgc agg att gga ate cag gag ccc ttt 2494 
Pro Gly His Ser Gly Ala Pro Cys Arg lie Gly lie Gin Glu Pro Phe 
740 745 750 755 

act gtg atg ccg acc act cca gta gcg tat aac teg ctg ggt gca gtg 2542 
Thr Val Met Pro Thr Thr Pro Val Ala Tyr Asn Ser Leu Gly Ala val 
760 765 770 

att ggc att gca gtg ctg ggg tec ctt gtg gta gec ctg gtg gca ctg 2590 
He Gly lie Ala Val Leu Gly Ser Leu Val Val Ala Leu Val Ala Leu 
775 780 785 

ttc att ggc tat egg cac tgg caa aaa ggc aag gag cac cac cac ctg 2638 
Phe He Gly Tyr Arg His Trp Gin Lys Gly Lys Glu His His His Leu 
790 795 800 

get gtg get tac age age ggg cgc ctg gac ggc tec gag tat gtc atg 2686 
Ala Val Ala Tyr Ser Ser Gly Arg Leu Asp Gly Ser Glu Tyr Val Met 
805 810 815 

cca gat gtc cct ccg age tac agt cac tac tac tec aac ccc age tac 2734 
Pro Asp Val Pro Pro Ser Tyr Ser His Tyr Tyr Ser Asn Pro Ser Tyr 
820 825 830 835 

cac acc ctg teg cag tgc tec cca aac ccc cca ccc cct aac aag gtt 2782 
His Thr Leu Ser Gin Cys Ser Pro Asn Pro Pro Pro Pro Asn Lys Val 
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840 845 850 

cca ggc ccg etc ttt gec age ctg cag aac cct gag egg cca ggt ggg 2 83 0 
Pro Gly Pro Leu Phe Ala Ser Leu Gin Asn Pro Qlu Arg Pro Gly Gly 
855 860 865 

gee caa ggg cat gat aac cac ace ace ctg cct get gac tgg aag cac 2 878 

Ala Gin Gly His Asp Asn His Thr Thr Leu Pro Ala Asp Trp Lys His 
870 875 880 

cgc egg gag ccc cct cca ggg cct ctg gac agg ggg age age cgc ctg 2 926 
Arg Arg Glu Pro Pro Pro Gly Pro Leu Asp Arg Gly Ser Ser Arg Leu 
885 890 835 

gac cga age tac age tat age tac age aat ggc cca ggc cca ttc tac 2974 
Asp Arg Ser Tyr Ser Tyr Ser Tyr Ser Asn Gly Pro Gly Pro Phe Tyr 
900 905 910 915 

gat aaa ggg etc ate tct gaa gag gag etc ggg gee agt gtg get tec 3 022 
Asp Lys Gly Leu He Ser Glu Glu Glu Leu Gly Ala Ser Val Ala Ser 
920 925 930 

ctg age agt gag aac cca tat gec ace ate egg gac ctg ccc age ttg 3070 
Leu Ser Ser Glu Asn Pro Tyr Ala Thr He Arg Asp Leu Pro Ser Leu 
935 940 945 

cca ggg ggc ccc egg gag age age tac atg gag atg aaa ggc cct ccc 3118 
Pro Gly Gly Pro Arg Glu Ser Ser Tyr Met Glu Met Lys Gly Pro Pro 
950 955 960 

tea gga tct gec ccc agg cag cct cct cag ttt tgg gac age cag agg 3166 
Ser Gly Ser Ala Pro Arg Gin Pro Pro Gin Phe Trp Asp Ser Gin Arg 
965 970 975 

egg egg caa ccc cag cca cag aga gac agt ggc ace tac gag cag ccc 3214 
Arg Arg Gin Pro Gin Pro Gin Arg Asp Ser Gly Thr Tyr Glu Gin Pro 
980 985 990 "* 995 

age ccc ctg ate cat gac cga gac tct gtg ggc tec cag ccc cct ctg 32 62 

Ser Pro Leu He His Asp Arg Asp Ser Val Gly Ser Gin Pro Pro Leu 
1000 1005 1010 

cct ccg ggc eta ccc ccc ggc cae tat gac tea ccc aag aac age cac 3310 
Pro Pro Gly Leu Pro Pro Gly His Tyr Asp Ser Pro Lys Asn Ser His 
1015 1020 " 1025 

ate cct gga cat tat gac ttg cct cca gta egg cat ccc cca tea cct 33 58 
He Pro Gly His Tyr Asp Leu pro Pro Val Arg His Pro Pro Ser Pro 
1030 1035 1040 

cca ctt cga cgc cag gac cgt tgaggageca ggatggtatg geagaggeca 34 09 

Pro Leu Arg Arg Gin Asp Arg 
1045 1050 

gcacacctgg ctgttgctgc tcaaggctgg ggacagagee tagtgtaccc ctgecaggag 34 69 

cagggagtgg accggcaggc tgtgaacatg aacaacgett aacagagcaa gtgatgggag 3529 

ccttgttcct gggttctacc atgggagacg ctgatcagca ggatgcctgg ctccctttcc 3589 

caacccactg ctcccaaggc ctccagggcc ctgtgtacat aaactggtgg gttggaagtt 3 649 
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gctgggtaac 
gggctctgtg 
agggcactta 
ctaactggcc 
tacaggctgg 
ccagtatcaa 
cagcctcacc 
tcctgtaggc 
cccactcgcc 
ctccgtacat 
ggcctacagg 
atctctgcct 
caatattagc 
gtggctgact 
gtatgctctg 
ccagggtaac 
ctgaggctga 
gccttgattt 
catatatgtg 
acactgcttt 
ctctacattg 
gtgttatagt 
catgaaccgc 
gcggccg 



tctgatttca 
cgtgtgtgtg 
ccatttagta 
tcctccattg 
ttagttactc 
aacttttgaa 
ttgaactgtg 
cactcttctt 
tgttcaccoc 
ctttcacagc 
gtgccaggca 
cccccactag 
tctcccttgg 
gaafctaagta 
acagttacag 
aacgcagagc 

ggggggtgca 

attgcacttc 
acagcaatag 
aagcaaatct 
cattttggta 
gatctgtgat 
acccatataa 



gacatgcgtg 
tttctgtgat 
gggagatgga 
attcagtgaa 
cctacctgaa 
ggccttaaag 
ttcctgtcac 
tctggcacag 
tgcttttcct 
cctgattgca 
cttctttaat 
actgtaagct 
cacagagtag 
ccagtgacat 
actgaataag 
tcaggtgtgg 
gaggctggag 
acaggtagca 
gttaagaaaa 
gttggcacca 
attgcttgca 
cagtgatctt 
cacggtaaac 



tggggtacct 

tttagaaggg 
accaacccaa 
ccttccaatg 
agccttcata 
gccctgcttt 
tgcacgccag 
ggacctgcac 
ttacacctcc 
gctgtgttca 
gggttc'ttte 
ccctgaaggc 
gcactcaaca 
gcagtaactg 
ttggagactt 
gaaggtgcca 
aaaggataac 
gaatttttaa 
gcaaagcaga 
tttttccaat 
atatttcaag 
tgatatatta 
ttaatcagca 



tttctgtgca 
taccaggcag 
ttaactctag 
catggctcat 
ggtgcctctt 
gcctggccca 
tcacaccggc 
acctggagtg 
tcctcaggga 
ctcaccaggt 
tttatgtgat 
aagaatccfcg 
aatgctcccc 
ctaagataga 
ccctaaaggg 
ggggcagggg 
aggagagagt 
agaaattgaa 
gaaattgaag 
agcatgtgcc 
cattttcatt 
ttgtaattgt 
aaaaaaaaaa 



tgctcagcct 
gfctctgtcct 
caatagcctc 
aatttcaaaa 
tgctcttctg 
tctgtctctc 
ctctaggtcc 
cccttcctcc 
agtgcccacc 
acctgcagaa 
tatttgatta 
tgcttatgct 
aaaaggctga 
fcgagccatct 
tggcatttcc 
tgcagagggg 
atacaggcat 
ggttttggga 
atttgtgtca 
cattttgggt 
gttattatat 
fctcggggcgc 
aaaaaaaagg 



3709 
3769 
3829 
3889 
3949 
4009 
40 69 
4129 
4189 
4249 
4309 
4369 
4429 
4489 
4549 
4609 
4669 
4729 
4789 
4849 
4909 
4969 
5029 
5036 



<210> 
<211> 
<212> 
<213> 
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Homo sapiens 



^400> 14 



Met 


Ser 


Pro 


Pro 


Leu 


Cys Pro Leu Leu Leu 


Leu 


Ala 


Val 


Gly 


Leu 


Arg 


1 








5 


10 










15 




Leu Ala Gly Thr 


Leu 


Asn Pro Ser Asp Pro 


Asn 


Thr 


Cys 


Ser 


Phe 


Trp 








20 




25 








30 






Glu 


Ser 


Phe 


Thr 


Thr 


Thr Thr Lys Glu Ser 


His 


Ser 


Arg 


Pro 


Phe 


Ser 






35 






40 






45 








Leu 


Leu 


Pro 


Ser 


Glu 


Pro Cys Glu Arg Pro 


Trp 


Glu 


Gly 


Pro 


His 


Thr 




50 








55 




60 










Cys 


Pro 


Ser 


Pro 


Gin 


Thr Gin Arg Lys Leu 


Leu 


Ala 


Ser 


Arg 


Asp 


Ser 


65 










70 


75 










80 


Phe 


Cys 


Met 


Val 


Cys 


Val Gly Ala Gly Val 


Gin 


Trp 


Arg 


Asp 


Arg 


Ser 










85 


90 










95 




Ala 


Leu 


Gin 


Pro 


Gin 


Thr Gly Asn Ala Leu 


Ser 


Met 


Arg 


Pro 


Gin 


Pro 








100 




105 








110 






Arg Val 


Leu 


Ser 


Gly Ala Pro Ser Leu Ala 


Ser 


Pro 


Gly 


His 


Thr 


Val 






115 






12 0 






125 








Val 


Val 


I»ys 


Thr 


Asp 


His Arg Gin Arg Leu 


Gin 


Cys 


Cys 


His 


Gly 


Phe 




130 








135 




140 










Tyr 


Glu 


Ser 


Arg 


Gly 


Phe Cys Val Pro Leu 


Cys 


Ala 


Gin 


Glu 


Cys 


Val 


145 










150 


155 










160 


Hie 


Gly Arg Cys 


Val 


Ala Pro Asn Gin Cys 


Gin 


Cys 


Val 


Pro 


Gly Trp 










165 


170 










175 




Arg Gly Asp Asp 


Cys 


Ser Ser Ala Pro Asn 


Cys 


Leu 


Gin 


Pro 


Cys 


Thr 








180 




185 








190 






Pro 


Gly Tyr Tyr 


Gly 


Pro Ala Cys Gin Phe 


Arg 


Cys 


Gin 


Cys 


His 


Gly 






195 






200 






205 








Ala 


Pro 


Cys 


Asp 


Pro Gin Thr Gly Ala Cys 


Phe 




Pro 


Ala 


Glu Arg 
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210 










215 




Thr 


Gly Pro 


Ser 


Cys 


Asp 


Val 


Ser 


225 










230 






Phe 


Cys 


Pro 


Ser 


Thr 
245 


His 


Pro 


Cys 


Pro 


Gin Gly Ser 


Cys 


Ser 


Cys 


Pro 








260 










Ser 


Leu 


Pro 


Cys 


Pro 


Glu Gly phe 






275 










280 


Cys 


Arg Cys 


His 


Asn Gly Gly Leu 




290 










295 




Arg 


Cys 


Ala 


Pro 


Gly Tyr Thr Gly 


305 










310 






Val 


Gly Arg Phe 


Gly Gin Asp 


Cys 










325 








Asp 


Ala Arg 


Cys 


Phe 


Pro 


Ala 


Asn 








340 










Phe 


Thr 


Gly Asp 


Arg 


Cys 


Thr 


Asp 






355 










360 


Gly 


Leu 
370 


Ser 


Cys 


Gin 


Ala 


Pro 
375 


Cys 


Ser 


Cys 


His 


Pro 


Met 


Asn 


Gly 


Glu 


385 










390 






Gly 


Leu 


His 


Cys 


Asn 
405 


Glu 


Ser 


Cys 


Cys 


Gin 


Glu 


His 
420 


Cys 


Leu 


Cys 


Leu 


Ser 


Gly Leu 


Cys 


Gin 


Cys 


Ala 


Pro 






435 










440 


Ser 


Leu 
450 


Cys 


Pro 


Pro 


Asp 


Thr 
455 


Tyr 


Ser 


Cys 


Glu 


Asn 


Ala 


lie 


Ala 


Cys 


465 










470 






Cys 


Lys 


Glu Gly 


Trp 


Gin Arg 


Gly 










485 








Gly 


Thr 


Trp 


Gly 
500 


Phe 


Ser 


Cys 


Asn 


Ala 


Val 


Cys 
515 


Ser 


Pro 


Gin 


Thr 


Gly 
52 0 


His 


Gly Ala 


His 


Cys 


Gin 


Leu 


Pro 




530 










535 




Gly 


Cys 


Ala 


Ser 


Arg 


Cys 


Asp 


Cys 


545 










550 






Val 


His 


Gly Arg 


Cys 


Gin 


Cys 


Gin 










565 








His 


Leu 


Ser 


Cys 


Pro 


Glu Gly Leu 








580 










Cys 


Thr 


Cys 


Lys 


Asn Gly Gly Thr 






595 










600 


Val 


Cys 
610 


Ala 


Pro 


Gly 


Phe 


Arg 
615 


Gly 


Pro 


Gly Arg Tyr 


Gly 


Lys 


Arg 


Cys 


625 










630 






Ser 


Phe 


Cys 


His 


Pro 


Ser Asn Gly 










645 








Thr 


Gly 


Pro 


Asp 
660 


Cys 


Ser 


Gin 


Pro 


Asn 


Cys 


Ala 


Gin 


Thr 


Cys 


Gin 


Cys 









220 










Cys 


Ser 


Gin Gly 


Thr 


Ser 


Gly 


Phe 






235 










240 


Gin 


Asn 


Gly Gly 


Val 


Phe 


Gin 


Thr 




250 










255 




Pro Gly 


Trp 


Met 


Gly 


Thr 


lie 


Cys 


265 










270 






His 


Gly 


Pro 


Asn 


Cys 


Ser 


Gin 


Glu 










285 








Cys 


Asp 


Arg 


Phe 


Thr 


Gly 


Gin 


Cys 








300 










Asp Arg 


Cys 


Arg 


Glu 


Glu 


Cys 


Pro 






315 










320 


Ala 


Glu 


Thr 


Cys 


Asp 


Cys 


Ala 


Pro 




330 










335 




Gly Ala 


Cys 


Leu 


Cys 


Glu 


His 


Gly 


345 










350 






Arg 


Leu 


Cys 


Pro 


Asp 


Gly 


Phe 


Tyr 










365 








Thr Cys 


Asp Arg 


Glu 


His 


Ser 


Leu 








380 










Cys 


Ser 


Cys 


Leu 


Pro 


Gly 


Trp 


Ala 






395 










400 


Pro 


Gin 


Asp Thr 


His 


Gly 


Pro 


Gly 




410 










415 




His 


Gly 


Gly Val 


Cys 


Gin 


Ala 


Thr 


425 










43 0 






Gly Tyr 


Thr Gly 


Pro 


His 


Cys 


Ala 










445 








Gly Val 


Asn 


Cys 


Ser 


Ala 


Arg 


Cys 








460 










Ser 


Pro 


lie 


Asp 


Gly 


Glu 


Cys 


Val 






475 










480 


Asn 


Cys 


Ser 


val 


Pro 


Cys 


Pro 


Pro 




490 










495 




Ala 


Ser 


Cys 


Gin 


Cys 


Ala 


His 


Glu 


505 










510 






Ala 


Cys 


Thr 


C Y S 


Thr 


Pro 


Gly 


Trp 










525 








Cys 


Pro 


Lys 


Gly 


Gin 


Phe 


Gly 


Glu 








540 










A sp 


His 


Ser Asp 


Gly 


Cys 


Asp 


Pro 






555 










560 


Ala Gly 


Trp Met 


Gly 


Ala 


Arg 


Cys 




570 










575 




Trp 


Gly 


Val 


Asn 


Cys 


Ser 


Asn 


Thr 


585 










590 






Cys 


Leu 


Pro 


Glu 


Asn 


Gly 


Asn 


Cys 










605 








Pro 


Ser 


Cys 


Gin 


Arg 


Ser 


Cys 


Gin 








620 










Val 


Pro 


Cys 


Lys 


Cys 


Ala 


Asn 


His 






635 










640 


Thr 


Cys 


Tyr 


Cys 


Leu 


Ala 


Gly 


Trp 




650 










655 




Cys 


Pro 


Pro 


Gly 


His 


Trp 


Gly 


Glu 


665 










670 






His 


His 


Gly Gly 


Thr 


Cys 


His 


Pro 
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675 680 685 

Gin Asp Gly Ser Cys lie Cys Pro Leu Gly Trp Thr Gly His His Cys 

690 695 700 

Leu Glu Gly Cys Pro Leu Gly Thr Phe Gly Ala Asn Cys Ser Gin Pro 
705 710 715 720 

Cys Gin Cys Gly Pro Gly Glu Lys Cys His Pro Glu Thr Gly Ala Cys 

725 730 735 

Val Cys Pro Pro Gly His Ser Gly Ala Pro Cys Arg lie Gly lie Gin 

740 745 750 

Glu Pro Phe Thr Val Met Pro Thr Thr Pro Val Ala Tyr Asn Ser Leu 

755 760 765 

Gly Ala Val lie Gly lie Ala Val Leu Gly Ser Leu Val Val Ala Leu 

770 775 780 

Val Ala Leu Phe lie Gly Tyr Arg His Trp Gin Lys Gly Lys Glu His 
785 790 * 795 800 

His His Leu Ala Val Ala Tyr Ser Ser Gly Arg Leu Asp Gly Ser Glu 

805 810 815 

Tyr Val Met Pro Asp Val Pro Pro Ser Tyr Ser His Tyr Tyr Ser Asn 

820 825 * 830 

Pro Ser Tyr His Thr Leu Ser Gin Cys Ser Pro Asn Pro Pro Pro Pro 

835 84 0 845 

Asn Lys Val Pro Gly Pro Leu Phe Ala Ser Leu Gin Asn Pro Glu Arg 

850 855 860 

Pro Gly Gly Ala Gin Gly His Asp Asn His Thr Thr Leu Pro Ala Asp 
865 " * 870 875 880 

Trp Lys His Arg Arg Glu Pro Pro Pro Gly Pro Leu Asp Arg Gly Ser 

885 890 895 

Ser Arg Leu Asp Arg Ser Tyr Ser Tyr Ser Tyr Ser Asn Gly Pro Gly 

900 905 910 

Pro Phe Tyr Asp Lys Gly Leu lie Ser Glu Glu Glu Leu Gly Ala Ser 

915 920 925 

Val Ala Ser Leu Ser Ser Glu Asn Pro Tyr Ala Thr lie Arg Asp Leu 

930 935 940 

Pro Ser Leu Pro Gly Gly Pro Arg Glu Ser Ser Tyr Met Glu Met Lys 
945 950 955 960 

Gly Pro Pro Ser Gly Ser Ala Pro Arg Gin Pro Pro Gin Phe Trp Asp 

965 970 975 

Ser Gin Arg Arg Arg Gin Pro Gin Pro Gin Arg Asp Ser Gly Thr Tyr 

980 985 990 

Glu Gin Pro Ser Pro Leu lie His Asp Arg Asp Ser Val Gly Ser Gin 

995 1000 1005 

Pro Pro Leu Pro Pro Gly Leu Pro Pro Gly His Tyr Asp Ser Pro Lys 

1010 1015 1020 

Asn Ser His lie Pro Gly His Tyr Asp Leu Pro Pro Val Arg His Pro 
1025 1030 1035 1040 

Pro Ser Pro Pro Leu Arg Arg Gin Asp Arg 
1045 1050 

<210> 15 
<211> 3150 
<212> DMA 
. <213> Homo sapiens 

<400> 15 

atgtcaccgc ctctgtgtcc cctccttctc ctggctgfcgg gcctgcggct ggctggaact 60 

ctcaacccca gtgatcccaa tacctgcagc ttctgggaaa gcttcactac caccaccaag 120 

gagtcccact cccgcccctt cagcctgctc ccctcagagc cctgcgagcg gccctgggag 180 

ggcccccata cttgccccag cecacaaact cagaggaaac tcctggcttc tagggattca 240 
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ttctgcatgg tctgtgtegg ggctggagtg cagtggcgag ategtagtgc actgcaacct 300 

caaacaggga atgcgctttc tatgcgccct cagcccagag tgttgagtgg tgccccttcc 360 

ctggcctccc ctggccacac tgtggtggtg aagacggacc accgccagcg cctgcagtgc 42 0 

tgccatggct tctatgagag cagggggttc tgtgtcccgc tctgtgccca ggagtgtgtc 480 

catggccgtt gtgtggcacc caatcagtgc caatgtgtgc caggctggcg gggcgacgac 540 

tgttccagtg ccccgaactg ccttcagccc tgtacccctg gctactatgg ccctgcctgc 60 0 

cagttccgct gccagtgcca tggggcaccc tgcgatcccc agactggagc ctgcttctgc 660 

cccgcagaga gaactgggcc cagctgtgac gtgtcctgtt cccagggcac ttctggcttc 720 

ttctgcccca gcacccatcc ttgccaaaat ggaggtgtct tccaaacccc acagggctcc 780 

tgcagctgcc cccctggctg gafcgggcacc atctgctccc tgccctgccc agagggcttt 84 0 

cacggaccca actgctccca ggaatgtcgc tgceacaacg gcggcctctg tgaccgattc 900 

actgggcagt gccgctgcgc tccgggttac actggggatc ggtgccggga ggagtgcccg 960 

gtgggccgct ttgggcagga ctgtgctgag acgtgcgact gcgccccgga cgcccgttgc 1020 

ttcccggcca acggcgcatg tctgtgcgaa cacggcttca ctggggaccg ctgcacggat 1080 

cgcctctgcc ccgacggctt ctacggtctc agctgccagg ccccctgcac ctgcgaccgg 1140 

gagcacagcc tcagctgcca cccgatgaac ggggagtgct cctgcctgcc gggctgggcg 1200 

ggcctccact gcaacgagag ctgcccgcag gacacgcatg ggccagggtg ccaggagcac 1260 

tgtctctgcc tgcacggtgg cgtctgccag gctaccagcg gcctctgtca gtgcgcgccg 1320 

ggttacacgg gccctcaatg tgctagtctt tgtcctcctg acacctacgg tgtcaactgt 1380 

tctgcacgct gctcatgtga aaatgccatc gcctgctcac ccatcgacgg cgagtgcgtc 144 0 

tgcaaggaag gttggcagcg tggtaactgc tctgtgccct gcccacccgg aacctggggc 1500 

ttcagttgca atgccagctg ccagtgtgcc catgaggcag tctgcagccc ccaaactgga 1560 

gcctgtacct gcacccctgg gtggcatggg gcccactgcc agctgccctg tccgaagggg 162 0 

cagtttggag aaggttgtgc cagtcgctgt gactgtgacc actctgatgg ctgtgaccct 1680 

gttcatggac gctgtcagtg ccaggctggc tggatgggtg cccgctgcca cctgtcctgc 1740 

cctgagggct tatggggagt caactgtagc aacacctgca cctgcaagaa tgggggcacc 1800 

tgtctccctg agaatggcaa ctgcgtgtgt gcacccggat tccggggccc ctcctgccag 1860 

agatcctgtc agcctggccg ctatggcaaa cgctgtgtgc cctgcaagtg cgctaaccac 1920 

tccttctgcc acccctcgaa cgggacctgc tactgcctgg ctggctggac aggccccgac 1980 

tgctcccagc catgccctcc aggacactgg ggagaaaact gtgcccagac ctgccaatgt 2040 

caccatggtg ggacctgcca tccccaggat gggagctgta tctgccccct aggctggact 2100 

ggacaccact gcttagaagg ctgccctctg gggacatttg gtgctaactg ctcccagcca 2160 

tgccagtgtg gtcctggaga aaagtgccac ccagagactg gggcctgtgt atgtccccca 222 0 

gggcacagtg gtgcaccttg caggattgga atccaggagc cctttactgt gafcgccgacc 228 0 

actccagtag cgtataactc gctgggtgca gtgattggca ttgcagtgct ggggtccctt 234 0 

gtggtagccc tggtggcact gttcatfcggc tatcggcact ggcaaaaagg caaggagcac 2400 

caccacctgg ctgtggctta cagcagcggg cgcctggacg gctccgagta tgtcatgcca 2460 

gatgtccctc cgagctacag tcactactac tccaacccca gctaccacac cctgtcgcag 252 0 

tgctccccaa accccccacc ccctaacaag gttccaggce cgctctttgc cagcctgcag 2580 

aaccctgagc ggccaggtgg ggcccaaggg catgataacc acaccaccct gcctgctgac 2640 

tggaagcacc gccgggagcc ccctccaggg cc tctggaca gggggagcag ccgcctggac 27 0 0 

cgaagctaca gctatagcta eagcaatggc ccaggcccat tctacgataa agggctcatc 2760 

tctgaagagg agctcggggc cagtgtggct tccctgagca gtgagaaccc atatgccacc 2 82 0 

atocgggacc tgcccagctt gccagggggc ccccgggaga gcagctacat ggagatgaaa 2 8 80 

ggccctccct caggatctgc ccccaggcag cctcctcagt tttgggacag ccagaggcgg 2 94 0 

cggcaacccc agccacagag agacagtggc aectacgagc agcccagccc cctgatccat 3 000 

gaccgagact ctgtgggctc ccagccccct ctgcctecgg gcctaccccc cggccactat 3 060 

gactcaccca agaacagcca catccctgga cattatgact tgcctccagt acggcatccc 3120 

ccatcacctc cacttcgacg ccaggaccgt 3150 

<210> 16 

<211> 2569 

<212> DNA 

<213> Mus musculus 

<220> 

<221> CDS 

<222> (2)... (1492) 
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<400> 16 

g teg acc cac gcg tec ggt gac cct gtt cat gga cag tgc cga tgt cag 4 9 
Ser Thr His Ala Ser Gly Asp Pro Val His Gly Gin Cys Arg Cys Gin 

1 5 10 ~ 15 

get ggt tgg atg ggc aca cgc tgc cac ctg cct tgc ccg gag ggc ttt 97 

Ala Gly Trp Met Gly Thr Arg Cys His Leu Pro Cys Pro Gin Gly Phe 

20 25 30 

tgg gga gee aac tgc agt aac acc tgt acc tgc aag aat ggt ggt acc 145 

Trp Gly Ala Asn Cys Ser Asn Thr Cys Thr Cys Lys Asn Gly Gly Thr 

35 40 J 45 

tgt gtg tct gag aat ggc aac tgc gtg tgc gca cca ggg ttc ega ggc 193 

Cys Val Ser Glu Asn Gly Asn Cys Val Cys Ala Pro Gly Phe Arg Gly 

50 55 60 

ccc tec tgc cag agg ccc tgc ccg cct ggt cgc tat ggc aaa cgc tgt 241 

Pro Ser Cys Gin Arg Pro Cys Pro Pro Gly Arg Tyr Gly Lys Arg Cys 

65 70 75 " 80 

gtg caa tgc aag tgt aac aac aac cat tct tec tgc cac cca teg gac 28 9 

Val Gin Cys Lys Cys Asn Asn Asn His Ser Ser Cys His Pro Ser Asp 

85 90 95 

ggg acc tgc tec tgc ctg gcg ggc tgg aca ggc cct gac tgc tec gag 33 7 

Gly Thr Cys Ser Cys Leu Ala Gly Trp Thr Gly Pro Asp Cys Ser Glu 

100 105 110 

gca tgt ccc cca ggc cac tgg gga etc aaa tgc tec caa etc tgc cag 385 

Ala cys Pro Pro Gly His Trp Gly Leu Lys Cys Ser Gin Leu Cys Gin 

115 120 125 

tgt cat cat ggt ggg acc tgc cac ccc cag gat ggg age tgt ate tgc 433 

Cys His His Gly Gly Thr Cys His Pro Gin Asp Gly Ser Cys lie Cys 

130 135 140 

acg cca ggc tgg act gga ccc aac tgc ttg gaa ggc tgc cca cca aga 481 

Thr Pro Gly Trp Thr Gly Pro Asn Cys Leu Glu Gly Cys Pro Pro Arg 

145 150 155 160 

atg ttt ggt gtc aac tgc tec cag eta tgt cag tgt gat etc gga gag 529 

Met Phe Gly Val Asn Cys Ser Gin Leu Cys Gin Cys Asp Leu Gly Glu 

165 170 175 

atg tgc cac cca gag act ggg get tgt gtc tgt ccc cca gga cac agt 577 

Met Cys His Pro Glu Thr Gly Ala Cys Val Cys Pro Pro Gly His Ser 

180 185 190 

ggt gca gac tgc aaa atg gga age cag gag tee ttc acc ata atg ccc 625 

Gly Ala Asp Cys Lys Met Gly Ser Gin Glu Ser Phe Thr He Met Pro 

195 200 205 

acc tct Ccc gtg acc cat aac tea ctg ggt gca gtg att ggc act gca 673 

Thr Ser Pro Val Thr His Asn Ser Leu Gly Ala Val He Gly He Ala 

210 215 220 

gta ctg gga acc etc gtg gtg gee ctg ata gca ctg ttc att ggc tac 721 
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val Leu Gly Thr Leu val val Ala Leu He Ala Leu Phe He Giy Tyr 

225 230 235 240 

cgc cag tgg caa aag ggc aag gaa cat gag cac ttg gca gtg get tac 769 

Arg Gin Trp Gin Lys Gly Lys Glu His Glu His Leu Ala Val Ala Tyr 
245 250 255 

age act ggg egg ctg gat ggc tct gat tac gtc atg cca gat gtc tct 817 

Ser Thr Gly Arg Leu Asp Gly Ser Asp Tyr Val Met Pro Asp Val Ser 
260 265 270 

ccg age tat agt cac tac tac tec aac ccc age tac cac aca ctg tct 865 

Pro Ser Tyr Ser His Tyr Tyr Ser Asn Pro Ser Tyr His Thr Leu ser 
275 280 285 

cag tgt tct cct aac ccc ccg ccc cct aac aag gtc cca ggc agt cag 913 

Gin Cys Ser Pro Asn Pro Pro Pro Pro Asn Lys Val Pro Gly Ser Gin 
290 295 300 

etc ttt gtc age tct cag gec cct gag egg cca age aga gee cac ggg 961 

Leu Phe Val Ser Ser Gin Ala Pro Glu Arg Pro Ser Arg Ala His Gly 

305 310 315 320 

cgt gag aac cat acc aca ctg ccc get gac tgg aag cac cgc egg gag 1009 

Arg Glu Asn His Thr Thr Leu Pro Ala Asp Trp Lys His Arg Arg Glu 
325 330 335 

ccc cat gac aga ggc gee age cac ctg gac cga age tat age tgt age 1057 

Pro His Asp Arg Gly Ala Ser His Leu Asp Arg Ser Tyr Ser Cys Ser 
340 345 350 

tat age cac agg aat ggc cca gga cca ttc tgt cat aaa ggt ccc ate 1105 

Tyr Ser His Arg Asn Gly Pro Gly Pro Phe Cys His Lys Gly Pro He 
355 360 365 

tct gaa gag gga eta ggg gca age gtt atg tec ctg age agt gag aac 1153 

Ser Glu Glu Gly Leu Gly Ala Ser Val Met Ser" Leu Ser Ser Glu Asn 
370 375 380 

ccc tat get acc ate cga gac ctg ccc age ctg cct ggg gaa ccc cga 12 01 

Pro Tyr Ala Thr He Arg Asp Leu Pro Ser Leu Pro Gly Glu Pro Arg 

385 390 395 400 

gaa agt ggc tat gtg gag atg aaa gga cct cca tea gtg tec cct ccc 1249 

Glu Ser Gly Tyr Val Glu Met Lys Gly Pro Pro Ser Val Ser Pro Pro 
405 410 415 

agg cag tct ctt cat etc egg gac agg cag cag egg caa ctg cag cca 1297 

Arg Gin Ser Leu His Leu Arg Asp Arg Gin Gin Arg Gin Leu Gin Pro 
420 425 430 

cag agg gac age ggc acc tat gag cag ccc age ccc ttg age cat aat 1345 

Gin Arg Asp Ser Gly Thr Tyr Glu Gin Pro Ser Pro Leu Ser His Asn 
435 440 445 

gaa gag tct ttg ggc tec acg ccc ccg ctt cct cca ggc ctg cct cct 13 93 

Glu Glu Ser Leu Gly Ser Thr Pro Pro Leu Pro Pro Gly Leu Pro Pro 
450 455 • 460 
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ggt cac tac gac tec ccc aag aac age cat ate cct gga cac tat gac 1441 
Gly His Tyr Asp Ser Pro Lys Asn Ser His lie Pro Gly His Tyr Asp 
465 470 475 * 480 

ttg cct cca gta egg cat cct cca tec cct cca tec egg cgc cag gac 14 89 

Leu Pro Pro Val Arg His Pro Pro Ser Pro Pro Ser Arg Arg Gin Asp 
485 490 495 

cgc tgaagagccg gcatggtatg ggagcgtgcc tatgtacctt gecaggagea 1542 
Arg 

gggactggac cagcaggcca cgaacagaaa cacttggtga agtgaacaga gaeggactgt 1602 

ggccctgtgc ttccaccgag ggagacacta gttgacaaag tgtctaaccc tcttttccaa 1662 

cccactgctc aagtccctgt ggacataagc tggtgggcag aatgttgtfcg tacaagtgtg 1722 

attttagatc gatttttttt taaagtatgt gttgggtacc ttttctgtgt gtatgctcag 1782 

gcaggctgtg tgtgtctcta gttggcttta gagggagtca ggtataggtt ctgccttctg 1842 

cactttccat cttatctagt agtcagcttc caagcttaac tagttagagc tccaccagca 1902 

gcaggcccta actacctgcc tgccctteac ccagtaatcc tccatgtctt tgctcagagg 1962 

attgctcccc gactctggtg ttgtcctcct ggtaegcett gaeggtcctg cagtctccct 2 022 

ttcccgtctt gcttcattct ttcccagaat gaaggctgtc tgccacccta cttcccagcc 2 082 

caggaattgg cacatctaag ttcagccttc ctaagttacc cgttgagtcc tgettgeect 2142 

tcacatattc cacagaacac ccaccccaca tetgettcat agctactctc ttctccacgt 22 02 

aeccacagaa ggcagaagtg gtaccaggca agaagatggg attgttgcat tttgfctttgt 22 62 

ttttgagact ctgtctcact atgtagtcct ggctggcctg gaactcaaga gctctgcctg 2322 

cctctgcctc tfcgagtgctg ggtttaacgg ctcagggtca catgcacagc teaagctgea 2382 

ctccgatgtg ctttcccctg ttgetagatt agcgtctgcc tccccctagt ggagaggctg 2442 

atcgccagct etctgatgea ggactctggt gtttaggctc actcactatt ggtttccttg 2502 

gcacagggta gtcactcaat aaatgttcct ctaaaagctg aaaaaaaaaa aaaaaaaggg 2 562 

cggccgc . 2569 

<210> 17 
<211> 497 
<212> PRT 

<213> Mus musculus 
<400> 17 

Ser Thr His Ala Ser Gly Asp Pro Val His Gly Gin Cys Arg Cys Gin 

1,5 10 15 

Ala Gly Trp Met Gly Thr Arg Cys His Leu Pro Cys Pro Glu Gly Phe 

20 " 25 ^ 30 

Trp Gly Ala Asn Cys ser Asn Thr Cys Thr Cys Lys Asn Gly Gly Thr 

35 40 * " 45 

Cys Val Ser Glu Asn Gly Asn Cys Val Cys Ala Pro Gly Phe Arg Gly 

50 55 60 

Pro Ser Cys Gin Arg Pro Cys Pro Pro Gly Arg Tyr Gly Lys Arg Cys 
65 70 75 80 

Val Gin Cys Lys Cys Asn Asn Asn His Ser Ser Cys His Pro Ser Asp 

85 90 95 

Gly Thr Cys Ser Cys Leu Ala Gly Trp Thr Gly Pro Asp Cys Ser Glu 

100 105 110 

Ala Cys Pro Pro Gly His Trp Gly Leu Lys Cys Ser Gin Leu Cys Gin 

115 120 125 

Cys His His Gly Gly Thr Cys His Pro Gin Asp Gly Ser Cys He Cys 

130 135 140 

Thr Pro Gly Trp Thr Gly Pro Asn Cys Leu Glu Gly Cys Pro Pro Arg 
145 150 155 160 
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Met 


Phe 


Gly 


Val 


Asn 
165 


Cys 


Ser 


Gin 


Leu Cys 
170 


Gin 


Cys 


Asp 


Leu 


Gly 
175 


Glu 


Met 


Cys 


His 


Pro 


Glu 


Thr 


Gly Ala 


Cys Val 


Cys 


Pro 


Pro 


Gly 


His 


Ser 








180 










185 








190 






Gly 


Ala 


Asp 
195 


Cys 


Lys 


Met 


Gly 


Ser 
200 


Gin Glu 


Ser 


Phe 


Thr 
205 


He 


Met 


Pro 


Thr 


Ser 
210 


Pro 


Val 


Thr 


His 


Asn 
215 


Ser 


Leu Gly 


Ala 


Val 
220 


He 


Gly 


He 


Ala 


Val 


Leu 


Gly 


Thr 


Leu 


Val 


Val 


Ala 


Leu He 


Ala 


Leu 


Phe 


He 


Gly 


Tyr 


225 










230 








235 










240 


Arg 


Gin 


Trp 


Gin 


Lys 
245 


Gly 


Lys 


Glu 


His Glu 
250 


His 


Leu 


Ala 


Val 


Ala 
255 


Tyr 


Ser 


Thr 


Gly 


Arg 


Leu 


Asp 


Gly Ser 


Asp Tyr 


Val 


Met 


Pro 


Asp 


Val 


Ser 








260 










265 








270 






Pro 


Ser 


Tyr 


Ser 


His 


Tyr 


Tyr 


Ser 


Asn Pro 


Ser 


Tyr 


His 


Thr 


Leu 


Ser 


Gin 




275 










280 








285 








Cys 
290 


Ser 


Pro 


Asn 


Pro 


Pro 
295 


Pro 


Pro Asn 


Lys 


Val 
300 


Pro 


Gly 


Ser 


Gin 


Leu 


Phe 


Val 


Ser 


Ser 


Gin 


Ala 


Pro 


Glu Arg 


Pro 


Ser 


Arg 


Ala 


His 


Gly 


305 










310 








315 










320 


Arg 


Glu 


Asn 


His 


Thr 
325 


Thr 


Leu 


Pro 


Ala Asp 
330 


Trp 


Lys 


His 


Arg 


Arg 
335 


Glu 


Pro 


His 


Asp 


Arg 
340 


Gly 


Ala 


Ser 


His 


Leu Asp 
345 


Arg 


Ser 


Tyr 


Ser 
350 


Cys 


Ser 


Tyr 


Ser 


His 
355 


Arg 


Asn 


Gly 


Pro 


Gly 
360 


Pro Phe 


Cys 


His 


Lys 
365 


Gly 


Pro 


He 


Ser 


Glu 
370 


Glu 


Gly 


Leu 


Gly 


Ala 
375 


Ser 


Val Met 


Ser 


Leu 
380 


Ser 


Ser 


Glu 


Asn 


Pro 


Tyr 


Ala 


Thr 


He 


Arg 


Asp 


Leu 


Pro Ser 


Leu 


Pro 


Gly 


Glu 


Pro 


Arg 


385 










330 








395 










400 


Glu 


Ser 


Gly 


Tyr 


Val 
405 


Glu 


Met 


Lys 


Gly Pro 
410 


Pro 


Ser 


Val 


Ser 


Pro 
415 


Pro 


Arg 


Gin 


Ser 


Leu 


His 


Leu 


Arg Asp 


Arg Gin 


Gin 


Arg 


Gin 


Leu 


Gin 


Pro 








420 










425 








430 






Gin 


Arg 


Asp 
435 


Ser 


Gly 


Thr 


Tyr 


Glu 
44 0 


Gin Pro 


Ser 


Pro 


Leu 
445 


Ser 


His 


Asn 


Glu 


Glu 
450 


Ser 


Leu 


Gly 


Ser 


Thr 
455 


Pro 


Pro Leu 


Pro 


Pro 

460 


Gly 


Leu 


Pro 


Pro 


Gly 


His 


Tyr 


Asp 


Ser 


Pro 


Lys 


Asn 


Ser His 


He 


Pro 


Gly 


His 


Tyr 


Asp 


4 65 










470 








475 










480 


Leu 


Pro 


Pro 


Val 


Arg 
485 


His 


Pro 


Pro 


Ser Pro 
490 


Pro 


Ser 


Arg 


Arg 


Gin 
495 


Asp 



Arg 



<210> 18 

<211> 1491 

<212> DNA 

<213> Mus musculus 

<400> 18 

tcgacccacg cgtccggtga ccctgttcat ggacagtgcc 

ggcacacgct gccacctgcc ttgcccggag ggcttttggg 

tgtacctgca agaatggtgg tacctgtgtg tctgagaatg 

gggttccgag gcccctcctg ccagaggccc tgcccgcctg 

gtgcaatgca agtgtaacaa caaccattct tcctgccacc 

tgcctggcgg gctggacagg ccctgactgc tccgaggcat 

cfccaaatgct cccaactctg ccagtgtcat catggtggga 
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gatgtcaggc tggttggatg 60 

gagccaactg cagtaacacc 120 

gcaactgcgt gtgcgcacca 180 

gtcgctatgg caaacgctgt 240 

catcggacgg gacctgctcc 300 

gtcccccagg ccactgggga 3 60 

cctgccaccc ccaggatggg 420 
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agctgtatct gcacgccagg ctggactgga cccaactgct tggaaggctg cccaccaaga 4 80 

atgtttggtg tcaactgctc ccagctatgt cagtgtgatc tcggagagat gtgccaccca 54 0 

gagactgggg cttgtgtctg tcccccagga cacagtggtg cagactgcaa aatgggaagc 600 

caggagtcct tcaccataat gcccacctct cccgtgaccc ataactcact gggtgcagtg 660 

attggcattg cagtactggg aaccctcgtg gtggccctga tagcactgtt cattggctac 720 

cgccagtggc aaaagggcaa ggaacatgag cacttggcag tggcttacag cactgggegg 7 80 

ctggatggct ctgattacgt catgccagat gtctctccga gctatagtca ctactactcc 840 

aaccccagct accacacact gtctcagtgt tctcctaacc ccccgccccc taacaaggtc 900 

ccaggcagtc agctctttgt cagctctcag gcccctgagc ggccaagcag agcccacggg 960 

cgtgagaacc ataccacact gcccgctgac tggaagcacc gccgggagcc ccatgacaga 102 0 

ggcgccagcc acctggaccg aagctatagc tgtagctata gccacaggaa tggcccagga 1080 

ccattctgtc ataaaggtcc catctctgaa gagggactag gggcaagcgt tatgtccctg 114 0 

agcagtgaga acccctatgc taccatccga gacctgccca gcctgcctgg ggaaccccga 12 00 

gaaagtggct atgtggagat gaaaggacct ccatcagtgt ccccteccag gcagtctctt 12 60 

catctccggg acaggcagca gcggcaactg cagccacaga gggacagcgg cacctatgag 1320 

cagcccagcc ccttgagcca taatgaagag tctttgggct ccacgccccc gattcctcca 13 80 

ggcctgcctc ctggtcacta cgactccccc aagaacagcc atatccctgg acactatgac 1440 

ttgcctccag tacggcatcc tccatcccct ccatcccggc gccaggaccg c 1491 

<210> 19 

<211> 3567 

<212> DNA 

<213> Rauttus sp. 

<220> 
<221> CDS 

<222> (925) . . . (2832) 
<400> 19 

gtccgaccca cgcgtccgag ccacaccctg aaggtggttg gaaggaggga aggatctagg 60 

tcctgagcac tggaattccc cagaacagca tctggcttcc cagacccatg ctggccacca 12 0 

ctgatgtgtc cttccggctg ctggctgcag tgctgttctg ttgttgggtg ccctgtggca 18 0 

ggcttgtgca atgccactct gtcccctcct cctdctggcc ctaggcctgc gtctggctgg 24 0 

aacactcaac tccaatgatc ccaatgtctg taccttctgg gaaagcttca ccacgaccac 300 

taaggagtcc caccttcgcc ccttcagcct gcccccagcc gagtcctgcg acaggccctg .3 60 

ggaagac.ccc cacacctgcg ctcagcctac ggttgtctac cggactgtgt accgtcaggt 42 0 

ggtgaagatg gactcccgcc cacgcctgca gtgctgtggg ggttactacg agagcagtgg 48 0 

agcctgtgtc ccactctgtg cccaggagtg tgtccacggt cgctgtgtgg ctccfcaatcg 54 0 

gtgccagtgt gcaccaggct ggcggggtga cgactgttcc agtgagtgtg ctcctggaat 60 0 

gtggggacca cagtgtgaca ggctctgcct ctgtggcaac agcagttcct gtgatcccag 660 

gagtggggtg tgtttttgcc cctctggcct gcagcccccc gactgccttc agccttgccc 720 

cgatggccac tatggtcctg cctgccagtt tgattgccat tgctatgggg catcctgtga 780 

cccccgggat ggagcctgct tctgcccccc agggagaaca ggacccaggg cactgatggc 84 0 

ttcttctgcc ccagaactta tccttgccaa aatggaggtg ttcctcaggg ctctcaaggc 900 

tcctgcagct gcccaccggg ctgg atg ggt gtc ate tgt tec ctg cca tgc 951 

Met Gly Val He Cys ser Leu Pro Cys 
1 5 

cca gag ggt ttc cac gga ccc aac tgt act cag gaa tgt cgt tgc cac 999 
Pro Glu Qly Phe His Gly Pro Asn Cys Thr Gin Glu Cys Arg Cys His 
10 15 20 25 

aat ggt ggc ctt tgt gac agg ttt act ggg cag tgc cac tgt get cct 1047 
Asn Gly Gly Leu Cys Asp Arg Phe Thr Gly Gin Cys His Cys Ala Pro 
30 35 40 

ggc tat ate ggg gat egg tgc cgt gaa gag tgc cct gtg ggc cgc ttc 1095 
Gly Tyr He Gly Asp Arg Cys Arg Glu Glu Cys Pro Val Gly Arg Phe 
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45 50 55 

ggt caa gac tgt get gag acc tgt gac tgt get cct ggc get egt tgc 114 3 
Gly Gin Asp Cys Ala Glu Thr Cys Asp Cys Ala Pro Gly Ala Arg Cys 
60. 65 70 

ttt cct gec aat ggc gcg tgt ctg tgc gaa cat ggc ttc aca ggc gac 1191 
Phe Pro Ala Asn Gly Ala Cys Leu Cys Glu His Gly Phe Thr Gly Asp 
75 80 85 

cgc tgc act gag cga etc tgt cca gat ggc cgc tat ggt ctg age tgc 123 9 

Arg Cys Thr Glu Arg Leu Cys Pro Asp Gly Arg Tyr Gly Leu Ser Cys 
90 95 100 105 

caa gat ccc tgc acc tgc gac cca gaa cac agt etc age tgc cac cca 1287 
Gin Asp Pro Cys Thr Cys Asp Pro Glu His Ser Leu Ser Cys His Pro 
110 115 ^ 120 

atg cac ggc gag tgc tec tgc cag cca ggt tgg gcg ggc etc cac tgc 13 35 

Met His Gly Glu Cys Ser Cys Gin Pro Gly Trp Ala Gly Leu His Cys 
125 130 135 

aac gag age tgc cct cag gac acg cac gga gee ggt tgc cag gag cac 13 83 

Asn Glu Ser Cys Pro Gin Asp Thr His Gly Ala Gly Cys Gin Glu His 
140 145 150 

tgc etc tgt ctg cac ggc ggt gtt tgc etc gee gac age ggc etc tgc 1431 
Cys Leu Cys Leu His Gly Gly Val Cys Leu Ala Asp Ser Gly Leu Cys 
155 160 165 

egg tgt gca cct ggc tac acg gga cct cac tgc get aat ctt tgt cca 1479 
Arg Cys Ala. Pro Gly Tyr. Thr Gly Pro His Cys Ala Asn Leu Cys Pro 
170 175 180 185 

cct aac act tat ggg ate aac tgt tec tec cac tgc tec tgt gaa aat 1527 
Pro Asn Thr Tyr Gly lie Asn Cys ser Ser His Cys Ser Cys Glu Asn 
190 195 200 

gec att gee tgc tct cct gtc gac ggc acg tgc ate tgc aag gaa ggt 1575 
Ala lie Ala Cys Ser Pro Val Asp Gly Thr Cys He Cys Lys Glu Gly 
205 210 215 

tgg cag cgt ggt aac tgc tct gtg ccc tgt ccc cct ggc acc tgg ggc 162 3 

Trp Gin Arg Gly Asn Cys Ser Val Pro Cys Pro Pro Gly Thr Trp Gly 
220 225 230 

ttc agt tgc aat gee agt tgc cag tgt gee cac gag gga gtc tgc age 1671 
Phe Ser Cys Asn Ala Ser Cys Gin Cys Ala His Glu Gly Val Cys Ser 
235 240 245 

ccc caa act gga gec tgt act tgc acc cct ggg tgg cgt ggg gtt cac 1719 
Pro Gin Thr Gly Ala Cys Thr Cys Thr Pro Gly Trp Arg Gly Val His 
250 255 260 265 

tgc caa ctt ccg tgc ccg aag gga cag ttt ggt gaa ggt tgt gec agt 1767 
Cys Gin Leu Pro Cys Pro Lys Gly Gin Phe Gly Glu Gly Cys Ala Ser 
270 275 ^ 280 



27 



BNSDOCID: <WO 0100673A1_IB> 



gtc tgt gac tgt gac cac tec gat ggc tgt gac cct gtt cat gga cac mi5 

Val Cys Asp Cys Asp His Ser Asp Gly Cys Asp Pro Val His Qly His 
285 290 295 

tgc cga tgt cag get ggc tgg atg ggc aca cgt tgc cac ctg cct tgc 1863 

Cys Arg Cys Gin Ala Gly Trp Met Gly Thr Arg Cys His Leu Pro Cys 

300 305 310 

cca gag ggc ttt tgg gga gec aac tgc age aat gec tgt acc tgc aag 1911 

Pro Glu Gly Phe Trp Gly Ala Asn Cys Ser Asn Ala Cys Thr Cys Lys 
315 320 325 

aat ggt ggc act tgt gta cct gag aac ggc aac tgt gtg tgc gca cca 1959 

Asn Gly Gly Thr Cys Val Pro Glu Asn Gly Asn Cys Val Cys Ala Pro 
330 335 340 345 

ggg ttc aga ggc ccc tec tgc cag agg ccc tgc ccg cct ggt cgc tat 2007 

Gly Phe Arg Gly Pro Ser Cys Gin Arg Pro Cys Pro Pro Gly Arg Tyr 

350 355 360 

ggc aaa cgc tgt gtg ccc tgc aag tgc aac aac cat tct tec tgc cac 2 055 

Gly Lys Arg Cys Val Pro Cys Lys Cys Asn Asn His Ser Ser Cys His 
365 370 375 

ccg teg gat ggg acc tgc tec tgc ctg gca ggc tgg aca ggc cct gac 2103 

Pro Ser Asp Gly Thr Cys Ser Cys Leu Ala Gly Trp Thr Gly Pro Asp 

380 385 390 

tgc tct gaa tea tgt ccc cca ggc cac tgg gga etc aaa tgc tec caa 2151 

Cys Ser Glu Ser Cys Pro Pro Gly His Trp Gly Leu Lys Cys Ser Gin 
395 400 405 

ccc tgc cag tgt cat cat ggt gec acc tgc cac ccc cag gat ggg age 2199 

Pro Cys Gin Cys His His Gly Ala Thr Cys His Pro Gin Asp Gly Ser 
410 415 420 425 

tgt gtc tgc ate cca ggc tgg act gga ccc aac tgc teg gaa ggc tgc 224 7 

Cys Val Cys lie Pro Gly Trp Thr Gly Pro Asn Cys Ser Glu Gly Cys 

430 435 440 

cca tea aga atg ttt ggt gtc aac tgc tec cag eta tgt cag tgt gat 22 95 

Pro Ser Arg Met Phe Gly Val Asn Cys Ser Gin Leu Cys Gin Cys Asp 
445 450 455 

cct gga gag atg tgc cac cca gag act ggg get tgc gtc tgt ccc cca 2343 

Pro Gly Glu Met Cys His Pro Glu Thr Gly Ala Cys Val Cys Pro Pro 

460 465 470 

gga cac agt ggt gcg cac tgc aaa gtg ggc age cag gag tec ttc acc 23 91 

Gly His Ser Gly Ala His Cys Lys Val Gly Ser Gin Glu Ser Phe Thr 
475 480 485 

ata atg ccc acc tct cct gtg ate cat aac tea ctg ggt gec gtg att 243 9 

He Met Pro Thr Ser Pro Val He His Asn Ser Leu Gly Ala Val He 
490 495 500 505 

ggc att gca gtg ctg ggg acc ctt gtg gtg gec ctg gta gca ctg ttt 24 87 

Gly He Ala Val Leu Gly Thr Leu Val Val Ala Leu Val Ala Leu Phe 
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510 515 520 

att ggc tac cga cac tgg caa aag ggc aag gaa cat gag cac ttg gca 2535 

He Gly Tyr Arg His Trp Gin Lys Gly Lys Glu His Glu His Leu Ala 

525 530 535 

gtg get tac age act ggg cga ctg gat ggc tec gat tac gtc atg cca 25 83 

Val Ala Tyr Ser Thr Gly Arg Leu Asp Gly Ser Asp Tyr Val Met Pro 

540 545 ~ 550 

gat gtc tct ccg age tac agt cac tac tat tec aac cct age tac cac 2631 

Asp Val Ser Pro Ser Tyr Ser His Tyr Tyr Ser Asn Pro Ser Tyr His 

555 560 565 

aca ctg tct cag tgt tct cct aac cct cca ccc cct aac aag att cca 2679 

Thr Leu Ser Gin Cys Ser Pro Asn Pro Pro Pro Pro Asn Lys lie Pro 

570 ' 575 580 585 

ggc agt cag ctg ttt gtc age tec cag gca tct gag egg cca aac aga 2727 

Gly Ser Gin Leu Phe Val Ser Ser Gin Ala Ser Glu Arg pro Asn Arg 

590 595 600 

aac cat ggg cga gat aac eac gec aca ctg ceo get gac tgg aag cac 2775 

Asn His Gly Arg Asp Asn Hie Ala Thr Leu Pro Ala Asp Trp Lys His 

605 610 615 

cga egg gag tec cat gac aga get ttc etc agg cac cag cca cct gga 2823 

Arg Arg Glu Ser His Asp Arg Ala Phe Leu Arg His Gin Pro Pro Gly 

620 625 _ 630 

ccg aag gta tagctgtagc tatggecaca ggaatggccc ggggecatte 2 872 
Pro Lys Val 
635 

tgtcataaag gtcccatctc tgaagaagga ctaggggcaa gcgtfcatgtc cctgagcagt 2932 

gagaacccct atgegaccat ccgagacctg cccggcctgc ctggggaacc ocgagaaagc 2992 

agctatgtgg agatgaaagg ccctccatca gtgtctcccc ccaggcagcc tcttcatctc 3052 

egggacagge agcagcagca actgeagtet cagagagaca gcggcaccta tgagcagccc 3112 

actcccttga geegtaatga agagtctgtg ggctccatgc ccccfccttcc tccgggcctg 3172 

ccacccggcc actatgactc gcccaaaaac agccacatcc ctggacacta tgacttgect 323 2 

ccagtacggc atcctccatc acctccatcc cggcgccagg acegctgagg agceagcatg 3292 

gtatgggaga gtgcctgtga accctgccag gagcagggee tggaccagca ggccatgaat 3 352 

agacatactt ggtgaagtga aeggagactg aggatggctc tgcttccacc gagggagaca 3412 

ctagttggca aagtgtctaa cctccctttt ccagcccatt gctcaagtcc cccaggctgt 3472 

ggacatgagc tggtgggcag aatgttgttg ttgaagtctg atttfcagatt gattttttaa 3 532 

aaaaaaaaaa aaaaaaaaaa aaaaagggcg geege 3567 

<210> 20 

<211> 636 

<212> PRT 

<213> Rauttus sp. 

<400> 20 

Met Gly Val lie Cys Ser Leu Pro Cys Pro Glu Gly Phe His Gly Pro 

1 5 10 15 
Asn Cys Thr Gin Glu Cys Arg Cys His Asn Gly Gly Leu cys Asp Arg 

20 25 30 

Phe Thr Gly Gin Cys His Cys Ala Pro Gly Tyr lie Gly Asp Arg Cys 
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35 




40 






45 








Arg 




Glu 


Cys 


Pro Val Gly Arg Phe Gly 


Gin Asp Cys 


Ala 


Glu 


Thr 




50 






55 




60 










Cys 


Asp 


Cys 


Ala 


Pro Gly Ala Arg Cys Phe 


Pro 


Ala 


Asn 


Gly 


Ala 


Cys 


65 








70 


75 










80 


Leu 


Cys 


Glu 


His 


Gly Phe Thr Gly Asp Arg 


Cys 


Thr 


Glu 


Arg 


Leu 


Cys 










85 90 










95 




Pro 


Asp 


Gly Arg Tyr Gly Leu Ser Cys Gin 


Asp 


Pro 


Cys 


Thr 


Cys 


Asp 








100 


105 








110 






PlTO 


VjJ.ll 


His 


Ser 


Leu Ser Cys His Pro Met 


His 


Gly Glu 


Cys 


Ser 


CyB 






115 




120 






125 










PlTO 


Gly Trp Ala Gly Leu His Cys Asn 


Glu 


Ser 


Cys 


Pro 


Gin 


Asp 




130 






135 




140 










Thr 


His 


Gly Ala Gly Cys Gin Glu Hie Cys 


Leu 


Cys 


Leu 


His 


Gly 


Gly 


145 








150 


155 










160 


Val 


Cys 


Leu 


Ala 


Asp Ser Gly Leu Cys Arg 


Cys 


Ala 


Pro 


Gly 


Tyr 


Thr 










165 ~ 170 










175 




Gly 


Pro 


His 


Cys 


Ala Asn Leu Cys Pro Pro 


Asn 


Thr 


Tyr 


Gly 


He 


Asn 








180 


185 








190 






Cys 


Ser 


Ser 


His 


Cys Ser Cys Glu Asn Ala 


He 


Ala 


Cys 


Ser 


Pro 


Val 






195 




200 






205 








Asp 


Gly 


Thr 


Cys 


He Cys Lys Glu Gly Trp 


Gin Arg Gly 


Asn 


Cys 


Ser 




210 






215 




220 










Val 


Pro 


Cys 


Pro 


Pro Gly Thr Trp Gly Phe 


Ser 


Cys 


Asn 


Ala 


Ser 


Cys 


225 








230 


235 










240 


Gin 


Cys 


Ala 


His 


Glu Gly Val Cys ser Pro 


Gin Thr Gly 


Ala 


Cys 


Thr 










245 250 










255 




Cys 


Thr 


Pro 


Gly Trp Arg Gly Val His Cys 


Gin 


Leu 


Pro 


Cys 


Pro 


Lys 








260 


265 








270 






Gly 


Gin 


Phe 


Gly Glu Gly Cys Ala Ser Val 


Cys 


Asp 


Cys 


Asp 


His 


Ser 






275 




280 






285 








Asp 


Gly 


Cys Asp Pro Val His Gly His Cys 


Arg 


Cys 


Gin 


Ala 


Gly 


Trp 




290 






295 




300 










Met 


Gly 


Thr 


Arg 


Cys His Leu Pro Cys Pro 


Glu 


Gly 


Phe 


Trp 


Gly 


Ala 


3 05 








310 


315 










320 


Asn 


Cys 


Ser 


Asn 


Ala Cys Thr Cys Lys Asn 


Gly Gly Thr 


Cys 


Val 


Pro 










325 3 30 










335 




Glu 


Asn 


Gly Asn 


Cys Val Cys Ala Pro Gly 


Phe 


Arg 


Gly 


Pro 


Ser 


Cys 








340 


345 








350 






Gin 


Arg 


Pro 


Cys 


Pro Pro Gly Arg Tyr Gly 


Lys 


Arg 


Cys 


Val 


Pro 


Cys 






355 




360 






365 








Lys 


Cys 


Asn 


Asn 


His Ser Ser Cys His Pro 


Ser 


Asp 


Gly 


Thr 


Cys 


Ser 




370 






375 




380 










Cys 


Leu 


Ala 


Gly Trp Thr Gly Pro Asp Cys 


Ser 


Glu 


Ser 


Cys 


Pro 


Pro 


385 








390 


3 95 










400 


Gly 


His 


Trp 


Gly Leu Lys Cys Ser Gin Pro 


Cys 


Gin 


Cys 


His 


His 


Gly 










405 410 










415 




Ala 


Thr 


Cys 


His 


Pro Gin Asp Gly Ser Cys 


Val 


Cys 


He 


Pro 


Gly 


Trp 








420 


425 








430 






Thr 


Gly 


Pro 


Asn 


Cys Ser Glu Gly Cys Pro 


Ser 


Arg Met 


Phe 


Gly 


Val 






435 




440 






445 








Asn 


Cys 


Ser 


Gin 


Leu Cys Gin Cys Asp Pro 


Gly 


Glu 


Met 


Cys 


His 


Pro 




450 






455 




460 










Glu 


Thr 


Gly Ala 


Cys Val Cys Pro Pro Gly 


His 


Ser Gly 


Ala 


His 


Cys 


465 








470 


475 










480 


Lys 


Val 


Gly 


Ser 


Gin Gin Ser Phe Thr He 


Met 


Pro 


Thr 


Ser 


Pro 


Val 










485 490 










495 




He 


His 


Asn 


Ser 


Leu Gly Ala Val lie Gly 


He 


Ala 


Val 


Leu 


Gly 


Thr 
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Leu Val Val Ala Leu Val Ala Leu Phe He Gly Tyr Arg His Trp Gin 

515 520 525 

Lys Gly Lye Glu His Glu His Leu Ala Val Ala Tyr Ser Thr Gly Arg 

530 535 540 

Leu Asp Gly Ser Asp Tyr Val Met Pro Asp Val Ser Pro Ser Tyr Ser 
545 . 550 * 555 560 

His Tyr Tyr Ser Asn Pro Ser Tyr. His Thr Leu Ser Gin Cys Ser Pro 

565 570 575 

Asn Pro Pro Pro Pro Asn Lys He Pro Gly Ser Gin Leu Phe Val Ser 

580 585 590 

Ser Gin Ala Ser Glu Arg Pro Asn Arg Asn His Gly Arg Asp Asn His 

595 600 605 

Ala Thr Leu Pro Ala Asp Trp Lys His Arg Arg Glu Ser His Asp Arg 

610 * 615 620 

Ala Phe Leu Arg His Gin Pro Pro Gly Pro Lys Val 
625 *" 630 635 

<210> 21 
<211> 1908 
<212> E>NA 
<213> Rattus Sp. 

<400> 21 

atgggtgtca tctgttccct gccatgccca gagggtttcc acggacccaa ctgtactcag 60 
gaatgtcgtt gccacaatgg tggcctttgt gacaggttta ctgggcagtg ccactgtgct 120 
cctggctata tcggggatcg gtgccgtgaa gagtgccctg tgggccgctt cggtcaagac 180 
tgtgctgaga cctgtgactg tgctcctggc gctcgttgct ttcctgccaa tggcgcgtgt 240 
ctgtgcgaac atggcttcac aggcgaccgc tgcactgagc gactctgtcc agatggccgc 3 00 

tatggtctga gdtgccaaga tccctgcacc tgcgacccag aacacagtct cagctgccac 3 60 

ccaatgcacg gcgagtgctc ctgccagcca ggttgggcgg gcctccactg caaogagagc 420 
tgccctcagg acacgcacgg agccggttgc caggagcact gcctctgtct gcacggcggt 480 
gtttgcctcg ccgacagcgg cctctgccgg tgtgcacctg gctacacggg acctcactgc 540 
gctaatcttt gtccacctaa cacttatggg atcaactgtt cctcccactg ctcctgtgaa 600 
aatgccattg cctgctctcc tgtcgacggc acgtgcatct gcaaggaagg ttggcagcgt 660 
ggtaactgct ctgtgecctg tccccctggc acctggggct tcagttgcaa tgccagttgc 720 
cagtgtgccc acgagggagb ctgcagcccc caaactggag cotgtacttg cacccctggg 780 
tggcgtgggg ttcactgcca acttccgtgc ccgaagggac agtttggtga aggttgtgcc 840 
agtgtctgtg actgtgacca ctccgatggc tgtgaccctg ttcatggaca ctgccgatgt 900 
caggctggct ggatgggcac acgttgccac ctgccttgcc cagagggcfct ttggggagcc 960 

aactgcagca atgcctgtac ctgcaagaat ggtggcactt gtgtacctga gaacggcaac 1020 

tgtgtgtgcg caccagggtt cagaggcccc tcctgccaga ggccctgccc gcctggtcgc 10 80 

tatggcaaac gctgtgtgcc ctgcaagtgc aacaaccatt cttcctgcca cccgtcggat 1140 

gggacctgct cctgcctggc aggctggaca ggccctgact gctctgaatc atgtccccca 12 00 

ggccactggg gactcaaatg ctcccaaccc tgccagtg;tc atcatggtgc cacctgccac 12 60 

ccccaggatg ggagctgtgt ctgcatccca ggctggactg gacccaactg ctcggaaggc 1320 

tgcccatcaa gaatgtttgg tgtcaactgo tcccagctat gtcagtgtga tcctggagag 13 80 

atgtgccacc cagagactgg ggcttgcgtc tgtcccccag gacacagtgg tgcgcactgc 1440 

aaagtgggca gccaggagtc cttcaccata atgcccacct ctcctgtgat ccataactca 1500 

ctgggtgccg tgattggcat tgcagtgctg gggacccttg tggtggccct ggtagcactg 1560 

tttatfcggct accgacactg gcaaaagggc aaggaaeatg agcacttggc agtggcttac 1620 

agcactgggc gaotggatgg ctccgattac gtcatgccag atgtctctcc gagctacagt 1680 

cactactatt ccaaccctag ctaccacaca ctgtctcagt gttctcctaa ccctccaccc 1740 

ectaacaaga ttccaggcag tcagctgttt gtcagctcce aggcatctga gcggccaaac 1800 

agaaaccatg ggcgagataa ccacgccaca ctgcccgctg actggaagca ccgacgggag. 1860 

tcccatgaca gagctttcct caggcaccag ccacctggac cgaaggta 1908 

<210> 22 
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<2Z1> 1497 

<212> OTA 

<213> Homo sajpiens 

<220> 
<:221> CDS 

<222> (217) . . . (684) 
<400> 22 

gtcgacccac gcgtccggct cccagcccac ccccaaacag acacagcgta gcccgggcca 60 
gctcttaagg agttcaggag tgagaagagg ccctcagaga tctgacagcc taggagtgcg 12 0 
tggacaccac ctcagcccac tgagcaggag tcacagcacg aagaccaagc gcaaagcgac 180 
ccctgccctc catcctgact gcfccctccta agagag atg gca ccg gcc aga gca 234 

Met Ala Pro Ala Arg Ala 
1 5 

gga ttc tgc ccc ctt ctg ctg ctt ctg ctg ctg ggg ctg tgg gtg gca 2 82 

Gly Phe Cys Pro Leu Leu Leu Leu Leu Leu Leu Gly Leu Trp Val Ala 
10 15 20 

gag ate cca gtc agt gcc aag ccc aag ggc atg acc tea tea cag tgg 330 
Glu He Pro Val Ser Ala Lys Pro Lys Gly Met Thr Ser Ser Gin Trp 
25 30 35 

ttt aaa att cag cac atg cag ccc age cct caa gca tgc aac tea gcc 378 
Phe Lys lie Gin His Met Gin Pro Ser Pro Gin Ala Cys Asn Ser Ala 
40 45 50 

atg aaa aac att aac aag cac aca aaa egg tgc aaa gac etc aac acc 426 
Met Lys Asn He Asn Lys His Thr Lys Arg Cys Lys Asp Leu Asn Thr 
55 60 65 70 

ttc ctg cac gag cct ttc tec agt gtg gcc gcc acc tgc cag acc ccc 474 
Phe Leu His Glu Pro Phe Ser Ser Val Ala Ala Thr Cys Gin Thr Pro 
75 80 85 

aaa ata gcc tgc aag aat ggc gat aaa aac tgc cac cag age cac ggg 522 
Lys lie Ala Cys Lys Asn Gly Asp Lys Asn Cys His Gin Ser His Gly 
90 95 100 

ccc gtg tec ctg acc atg tgt aag etc acc tea ggg aag tat ccg aac 570 
Pro Val Ser Leu Thr Met Cys Lys Leu Thr Ser Gly Lys Tyr Pro Asn 
105 110 115 

tgc agg tac aaa gag aag cga cag aac aag tct tac gta gtg gcc tgt 618 
Cys Arg Tyr Lys Glu Lys Arg Gin Asn Lys Ser Tyr Val Val Ala Cys 
120 125 130 

aag cct ccc cag aaa aag gac tct cag caa ttc cac ctg gtt cct gta 666 
Lys Pro Pro Gin Lys Lys Asp Ser Gin Gin Phe His Leu Val Pro Val 
135 140 145 150 

cac tfcg gac aga gtc ctt taggtttcca gactggcttg ctctttggct 714 
His Leu Asp Arg Val Leu 
155 

gaccttcaat tccctctcca ggactccgca ccactcecct acacccagag cattct'cttc 774 
ccctcatctc ttggggctgt tcctggttca gcctctgctg ggaggctgaa gctgacactc 834 
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tggtgagctg 
ttatccccaa 
gtctcccctg 
ggcatatggg 
ctgacgtggc 
ttgtatagaa 
atggatgagg 
tagtagcaga 
ctctaccagt 
agttcactca 
aaggtcatta 
aaa 



agctctagag 
gaaacagcaa 
ccccctggca 
atttgtggac 
agtgaggtga 
tcctctaatc 
aaattaaggt 
acctggactt 
tgcgcaagaa 
tgaagaaacg 
cctctctagc 



ggatggcttt 

gctcaggtct 
ttagggcagc 
acagctgttt 
cctgaaggaa 
ccttgtgaca 
tttagaaagc 
gaacctaggt 
agaagtcact 
agtgctctga 
caaaaaaaaa 



tcatcttttt 
gtgggttccc 
atgacaagga 
ctgttcctga 
agaaaaatat 
tagacttgac 
ttaatgaatt 
ctccttgctc 
gttacagagg 
agagccagtt 
aaaaaaaaaa 



gttgctgttt 
tggtctatgc 
gaggaaataa 
actagaagtc 
aaataaatac 
agggattgta 
aaagagcttg 
taaatacagt 
caagcggtga 
accctgtgtt 
aaaaaaaaaa 



tcccagatgc 
cattgcacat 
atggaaaggg 
ttccccagct 
cacttcatat 
tgccttcttt 
tctaattagt 
gtaccttcta 
actaggtaag 
ggctgcaata 
aaaaaaaaaa 



894 
954 
1014 
1074 
1134 
1194 
1254 
1314 
13 74 
1434 
1494 
1497 





<210> 


23 
























<211> 


156 
























<212> 


PRT 
























<213> 


Homo sapiens 


















<400> 


23 






















Met 


Ala 


Pro 


Ala 


Arg 


Ala 


Gly 


Phe Cys Pro 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


1 








5 






10 










15 




Leu 


Gly 


Leu 


Trp 


Val 


Ala 


Glu 


lie Pro Val 


Ser 


Ala 


Lys 


Pro 


Lys 


Gly 








20 








25 








30 






Met 


Thr 


Ser 


Ser 


Gin 


Trp 


Phe 


Lys lie Gin 


His 


Met 


Gin 


Pro 


Ser 


Pro 






35 










40 






45 








Gin 


Ala 


Cys 


Asn 


Ser 


Ala 


Met 


Lys Asn lie 


Asn 


Lys 


His 


Thr 


Lys 


Arg 




50 










55 






60 










Cys 


Lys 


Asp 


Leu 


Asn 


Thr 


Phe 


Leu His Glu 


Pro 


Phe 


Ser 


Ser 


Val 


Ala 


65 










70 






75 










80 


Ala 


Thr 


Cys 


Gin 


Thr 


Pro 


Lys 


lie Ala Cys 


Lys 


Asn 


Gly 


Asp 


Lys 


Asn 










85 






90 










95 




Cys 


His 


Gin 


Ser 


His 


Gly 


Pro 


Val Ser Leu 


Thr 


Met 


Cys 


Lys 


Leu 


Thr 








100 








105 








110 






Ser 


Gly 


Lys 


Tyr 


Pro 


Asn 


Cys 


Arg Tyr Lys 


Glu 


Lys 


Arg 


Gin 


Asn 


Lys 






115 










12 0 






125 








Ser 


Tyr 


Val 


val 


Ala 


Cys 


Lys 


Pro Pro Gin Lys 


Lys 


Asp 


Ser 


Gin 


Gin 




130 










135 






14 0 










Phe 


His 


Leu 


val 


Pro 


Val 


His 


Leu Asp Arg Val 


Leu 










145 










150 






155 













<210> 
<211> 
<212> 
<213> 



24 

468 

DtfA 

Homo sapiens 



<400> 24 

atggcaccgg ccagagcagg attctgcccc cttctgctgc ttctgctgct ggggctgtgg 60 

gtggcagaga tcccagtcag tgccaagccc aagggcatga cctcatcaca gtggtttaaa 120 

attcagcaca tgcagcccag ccctcaagca tgcaactcag ccatgaaaaa cattaacaag 180 

cacacaaaac ggtgcaaaga cctcaacacc ttcctgcacg agcctttctc cagtgtggcc 240 

gccacctgcc agacccccaa aatagcctgc aagaatggcg ataaaaactg ccaccagagc 300 

cacgggcccg tgtccctgac catgtgtaag ctcacctcag ggaagtatcc gaactgcagg 360 

tacaaagaga agcgacagaa caagtcttac gtagtggcct gtaagcctcc ccagaaaaag 42 0 

gactctcagc aattccacct ggttcctgta cacttggaca gagtcctt 468 

<210> 25 
<211> 1788 
<212> DNA 
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<213> Homo sapiens 

<220> 

<221> CDS 

<222> (62) . . . (976) 



<400> 25 

gtcgacccac ggcgtccggc caggctccac tgaggggaac ggggacctgfc ctgaagagaa 60 
g atg ccc ctg ctg aca etc tac ctg etc etc ttc tgg etc tea ggc tac 109 
Met Pro Leu Leu Thr Leu Tyr Leu Leu Leu Phe Trp Leu Ser Gly Tyr 
1 5 10 15 



tec att gee act caa ate acc ggt cca aca aca gtg aat ggc ttg gag 157 
Ser lie Ala Thr Gin He Thr Gly Pro Thr Thr Val Asn Gly Leu Glu 
20 25 30 

egg. ggc tec ttg acc gtg cag tgt gtt tac aga tea ggc tgg gag acc 2 05 

Arg Gly Ser Leu Thr Val Gin Cys Val Tyr Arg Ser Gly Trp Glu Thr 
35 40 45 

tac ttg aag tgg tgg tgt cga gga get att tgg cgt gac tgc aag ate 253 
Tyr Leu Lys Trp Trp Cys Arg Gly Ala He Trp Arg Asp Cys Lys He 
50 55 60 

ctt gtt aaa acc agt ggg tea gag cag gag gtg aag agg gac egg gtg 3 01 

Leu Val Lys Thr Ser Gly Ser Glu Gin Glu Val Lys Arg Asp Arg Val 
65 70 75 80 

tec ate aag gac aat cag aaa aac cgc acg ttc act gtg acc atg gag 34 9 

Ser He Lys Asp Asn Gin Lys Asn Arg Thr Phe Thr Val Thr Met Glu 
85 90 95 

gat etc atg aaa act gat get gac act tac tgg tgt gga att gag aaa 3 97 

Asp Leu Met Lys Thr Asp Ala Asp Thr Tyr Trp Cys Gly He Glu Lys 
100 105 110 

act gga aat gac ctt ggg gtc aca gtt caa gtg acc att gac cca gcg 445 
Thr Gly Asn Asp Leu Gly Val Thr Val Gin Val Thr He Asp Pro Ala 
115 120 125 

teg act cct gee ccc acc acg cct act tec act acg ttt aca gca cca 493 
Ser Thr Pro Ala Pro Thr Thr Pro Thr Ser Thr Thr Phe Thr Ala Pro 
130 135 140 

gtc acc caa gaa gaa act age age tec cca act ctg acc ggc cac cac 541 
Val Thr Gin Glu Glu Thr Ser Ser Ser Pro Thr Leu Thr Gly His His 
145 150 155 160 



ttg gac aac agg cac aag etc ctg aag etc agt gtc etc ctg ccc etc 5 89 

Leu Asp Asn Arg His Lys Leu Leu Lys Leu Ser Val Leu Leu Pro Leu 
165 170 175 

ate ttc acc ata ttg ctg ctg ctt ttg gtg gee gec tea etc ttg get 63 7 

He Phe Thr He Leu Leu Leu Leu Leu Val Ala Ala Ser Leu Leu Ala 
180 185 190 

tgg agg atg atg aag tac cag cag aaa gca gec ggg atg tec cca gag 685 
Trp Arg Met Met Lys Tyr Gin Gin Lys Ala Ala Gly Met Ser Pro Glu 
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195 



200 



205 



cag gta ctg cag ccc ctg 
Gin Val Leu Gin Pro Leu 
210 

ctg cag ctg gcc gga acc 
Leu Gin Leu Ala Gly Thr 
225 230 

tct gcc cag gtt gac cag 
Ser Ala Gin Val Asp Gin 
245 

ttg ccg aag gag gac att 
Leu Pro Lys Glu Asp lie 
260 

gat cag gaa ccg acc tac 
Asp Gin Glu Pro Thr Tyr 
275 

ccc ggc agg ggc cct gag 
Pro Gly Arg Gly Pro Glu 
290 

cct tagcctgcac tccaggctcc ttcttggacc ccaggctgtg agcacactcc 102 6 

Pro 

305 

tgcctcatcg accgt.ctgcc ' ccctgctccc ctcatcagga ccaacccggg gactggtgcc 1086 
tctgcctgat cagccagcat tgcccctagc tctgggttgg gcttggggcc aagtctcagg 1146 
gggcttctag gagttggggt tttctaaacg tcccctcctc tcctacafcag ttgaggaggg 1206 
ggctagggat atgctctggg gctttcatgg gaatgatgaa gatgataafcg agaaaaatgt 1266 
tatcattatt atcatgaagt accattatca taatacaatg aacctttatt tattgcctac 1326 
cacatgttat gggctgaata atggccccca aagatatctg tgtcctaatc ctcagaactt 1386 
gtgactgtta ccttctgtgg cagaaaggga cagtgcagat gtatgtaagt taaggacttt 1446 
gagatagaga ggttattctt gctgattcag gtgggcccaa aatat caeca caagggtcct 1506 
cataagaaag aggecagaag gtcaaagagg tagagacaaa gtgatgatgg aagtggacgt 1566 
gggtgtgacg tgagcagggg ecatgaatge cgcagccttc agatgecaga aagggaaagg 1626 
aatggattcc cctgcctgga gcctccaaaa gaaaccagcc ctgcccacgc cttgacttga 1686 
geccattgaa actgatcttg agctcctggc ctccagaatt gcaggagaat aaatttgtgt 1746 
tgtttttaaa aaaaaaaaaa aaaaaaaagg gcggccgcta ga 1788 

<210> 26 
<211> 305 
<212> PRT 

<213> Homo sapiens 
<400> 26 

Met Pro Leu Leu Thr Leu Tyr Leu Leu Leu Phe Trp Leu Ser Gly Tyr 

1 5 10 15 

Ser lie Ala Thr Gin lie Thr Gly Pro Thr Thr Val Asn Gly Leu Glu 

20 25 30 

Arg Gly Ser Leu Thr Val Gin Cys Val Tyr Arg Ser Gly Trp Glu Thr 

35 40; 45 

Tyr Leu Lys Trp Trp Cys Arg Gly Ala He Trp Arg Asp Cys Lys lie 
50 55 60 



gag ggc gac etc tgc tat gca gac ctg acc 733 
Glu Gly Asp Leu Cys Tyr Ala Asp Leu Thr 
215 220 

tec ccg cga aag get acc acg aag ctt tec 781 
Ser Pro Arg Lys Ala Thr Thr Lys Leu Ser 
235 240 

gtg gaa gtg gaa tat gtc acc atg get tec 829 
Val Glu Val Glu Tyr Val Thr Met Ala Ser 
250 255 

tec tat gca tct ctg acc ttg ggt get gag 877 
Ser Tyr Ala Ser Leu Thr Leu Gly Ala Glu 
265 270 

tgc aac atg ggc cac etc agt age cac etc 925 
Cys Asn Met Gly His Leu Ser Ser His Leu 
280 285 

gag ccc acg gaa tac age acc ate age agg 973 
Glu Pro Thr Glu Tyr Ser Thr lie Ser Arg 
295 300 
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Leu 


Val 


Lys 


Thr 


Ser 


Gly Ser 


Glu 












70 






Ser 


lie 


Lys 


Asp 


Asn 


Gin Lys 


Asn 










85 








Asp 


Leu 


Met 


Lys 


Thr 


Asp 


Ala 


Asp 








100 










Thr 


Gly 


Asn 


Asp 


Leu 


Gly Val 


Thr 






115 










120 


Ser 


Thr 


Pro 


Ala 


Pro 


Thr 


Thr 


Pro 




130 










135 




Val 


Thr 


Gin 


Glu 


Glu 


Thr 


Ser 


Ser 


145 










150 






Leu 


Asp 


Asn 


Arg His 


Lys 


Leu 


Leu 










165 








lie 


Phe 


Thr 


He 


Leu 


Leu 


Leu 


Leu 








180 










Trp 


Arg 


Met 


Met 


Lys 


Tyr 


Gin 


Gin 






195 










200 


Gin 


Val 


Leu 


Gin 


Pro 


Leu 


Glu 


Gly 




210 










215 




Leu 


Gin 


Leu 


Ala Gly 


Thr 


Ser 


Pro 


225 










230 






Ser 


Ala 


Gin 


Val 


Asp 


Gin 


Val 


Glu 










245 








Leu 


Pro 


Lys 


Glu 


Asp 


He 


Ser 


Tyr 








260 










Asp 


Gin 


Glu 


Pro 


Thr 


Tyr 


Cys 


Asn 






275 










280 


Pro 


Gly 


Arg 


Gly Pro 


Glu 


Glu 


Pro 




290 










295 





Pro 
305 



Gin 


Glu 


Val 


Lys 


Arg 


Asp 


Arg 


Val 






75 










80 


Arg 


Thr 


Phe 


Thr 


Val 


Thr 


Met 


Glu 




90 










95 




Thr 


Tyr 


Trp 


Cys 


Gly 


He 


Glu 


Lys 


105 










110 






Val 


Gin 


Val 


Thr 


He 


Asp 


Pro 


Ala 










125 








Thr 


Ser 


Thr 


Thr 


Phe 


Thr 


Ala 


Pro 








140 










Ser 


Pro 


Thr 


Leu 


Thr 


Gly 


His 


His 






155 










160 


Lys 


Leu 


Ser 


Val 


Leu 


Leu 


Pro 


Leu 




170 










175 




Leu 


Val 


Ala 


Ala 


Ser 


Leu 


Leu 


Ala 


185 










190 






Lys 


Ala 


Ala Gly 


Met 


Ser 


Pro 


Glu 










205 








Asp 


Leu 


Cys 


Tyr 


Ala 


Asp 


Leu 


Thr 








220 










Arg 


Lys 


Ala 


Thr 


Thr 


Lys 


Leu 


Ser 






235 










240 


Val 


Glu 


Tyr Val 


Thr 


Met 


Ala 


Ser 




250 










255 




Ala 


Ser 


Leu 


Thr 


Leu 


Gly 


Ala 


Glu 


265 










270 






Met 


Gly 


His 


Leu 


Ser 


Ser 


His 


Leu 










285 








Thr 


Glu 


Tyr 


Ser 


Thr 


lie 


Ser 


Arg 



300 



<210> 27 

<211> 915 

<212> DNA 

<213> Homo sapiens 

<400> 27 

atgcccctgc tgacactcta cctgctcctc ttctggctct caggctactc cattgccact 60 

caaatcaccg gtccaacaac agtgaatggc ttggagcggg gctccttgac cgtgcagtgt 120 

gtttacagat caggctggga gacctacttg aagtggtggt gtcgaggagc tatttggcgt 180 

gactgcaaga tccttgttaa aaccagtggg tcagagcagg aggtgaagag ggaccgggtg 24 0 

tccatcaagg acaatcagaa aaaccgcacg ttcactgtga ccatggagga tctcatgaaa 300 

actgatgctg acacttactg gtgtggaatt gagaaaactg gaaatgacct tggggtcaca 360 

gttcaagtga ccattgaccc agcgtcgact cctgccccca ccacgcctac ttccactacg 420 

tttacagcac cagtcaccca agaagaaact agcagctccc caactctgac cggccaccac 480 

ttggacaaca ggcacaagct cctgaagctc agtgtcctcc tgcccctcat cttcaccata 540 

ttgctgctgc ttttggtggc cgcctcactc ttggcttgga ggatgatgaa gtaccagcag 60 0 

aaagcagccg ggatgtcccc agagcaggta ctgcagcccc tggagggcga cctctgctat 660 

gcagacctga ccctgcagct ggccggaacc tccccgcgaa aggctaccac gaagctttcc 720 

tctgcccagg ttgaccaggt ggaagtggaa tatgtcacca tggcttcctt gccgaaggag 78 0 

gacatttcct atgcatctct gaccttgggt gctgaggatc aggaaccgac ctactgcaac 840 

atgggccacc tcagtagcca cctccccggc aggggccctg aggagcccac ggaatacagc 900 
accatcagca ggcct 915 

<210> 28 
<211> 3258 
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<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 

<222> (42) . . . (1625) 
<400> 28 

cacgcgtccg gccagttctt ggaggagact ctgcacaggg c atg gat cac tgt ggt 56 

Met Asp His cys Gly 
1 " 5 

gcc ctt ttc ctg tgc ctg tgc ctt ctg act ttg cag aat gca aca aca 104 
Ala Leu Phe Leu Cys Leu Cys Leu Leu Thr Leu Gin Asn Ala Thr Thr 
10 15 20 

gag aca tgg gaa gaa etc ctg age tac atg gag aat atg cag gtg tec 152 
Glu Thr Trp Glu Glu Leu Leu Ser Tyr Met Glu Asn Met Gin Val Ser 
25 30 35 

agg ggc egg age tea gtt ttt tec tct cgt caa etc cac cag ctg gag 200 
Arg Gly Arg Ser Ser Val Phe Ser Ser Arg Gin Leu His Gin Leu Glu 
40 45 50 

cag atg eta ctg aac acc age ttc cca ggc tac aac ctg acc ttg cag 248 
Gin Met Leu Leu Asn Thr Ser Phe Pro Gly Tyr Asn Leu Thr Leu Gin 
55 60 65 

aca ccc acc ate cag tct ctg gcc ttc aag ctg age tgt gac ttc tct 296 
Thr Pro Thr lie Gin Ser Leu Ala Phe Lys Leu Ser Cys Asp Phe Ser 
70 75 80 85 

ggc etc teg ctg acc agt gcc act ctg aag egg gtg ccc cag gca gga 344 
Gly Leu Ser Leu Thr Ser Ala Thr Leu Lys Arg Val Pro Gin Ala Gly 
90 95 100 

ggt cag cat gcc egg ggt cag cac gcc atg cag ttc ccc gcc gag ctg 3 92 

Gly Gin His Ala Arg Gly Gin His Ala Met Gin Phe Pro Ala Glu Leu 
105 110 115 

acc egg gac gcc tgc aag acc cgc ccc agg gag ctg egg etc ate tgt 440 
Thr Arg Asp Ala Cys Lys Thr Arg Pro Arg Glu Leu Arg Leu He Cys 
120 125 130 

ate tac ttc tec aac acc cac ttt ttc aag gat gaa aac aac tea tct 488 
He Tyr phe Ser Asn Thr His Phe Phe Lys Asp Glu Asn Asn Ser Ser 
135 140 145 

ctg ctg aat aac tac gtc ctg ggg gcc cag ctg agt cat ggg cac gtg 536 
Leu Leu Asn Asn Tyr Val Leu Gly Ala Gin Leu Ser His Gly His Val 
150 155 160 165 

aac aac -etc agg gat cct gtg aac ate age ttc tgg cac aac caa age 584 
Asn Asn Leu Arg Asp Pro Val Asn He Ser Phe Trp His Asn Gin Ser 
170 175 18 0 

ctg gaa ggc tac acc ctg acc tgt gtc ttc tgg aag gag gga gcc agg 632 
Leu Glu Gly Tyr Thr Leu Thr Cys Val Phe Trp Lys Glu Gly Ala Arg 
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185 190 195 

aaa cag ccc tgg ggg ggc tgg age cct.gag ggc tgt cgt aca gag cag 680 
Lys Gin Pro Trp Gly Gly Trp Ser Pro Glu Gly Cys Arg Thr Glu Gin 
200 205 210 

ccc tec cac tct cag gtg etc tgc cgc tgc aac cac etc ace tac ttt 72 8 

Pro Ser His Ser Gin Val Leu Cys Arg Cys Asn His Leu Thr Tyr Phe 
215 220 225 

get gtt etc atg caa etc tec cca gee ctg gtc cct gca gag ttg ctg 776 
Ala Val Leu Met Gin Leu Ser Pro Ala Leu Val Pro Ala Glu Leu Leu 
230 235 240 245 

gca cct ctt acg tac ate tec etc gtg ggc tgc age ate tec ate gtg 824 
Ala Pro Leu Thr Tyr lie Ser Leu Val Gly Cys Ser He Ser He Val 
250 255 260 

gee teg ctg ate aca gtc ctg ctg cac ttc cat ttc agg aag cag agt 872 
Ala Ser Leu He Thr Val Leu Leu His Phe His Phe Arg Lys Gin Ser 
265 270 275 

gac tee tta aca cgc ate cac atg aac ctg cat gee tec gtg ctg etc 920 
Asp Ser Leu Thr Arg He His Met Asn Leu His Ala Ser Val Leu Leu 
280 285 290 

ctg aac ate gee ttc ctg ctg age ccc gca ttc gca atg tct cet gtg 968 
Leu Asn He Ala Phe Leu Leu Ser Pro Ala Phe Ala Met Ser Pro Val 
295 300 305 

cce ggg tea gca tgc acg get ctg gec get gee ctg cac tac gcg ctg 1016 
Pro Gly Ser Ala Cys Thr Ala Leu Ala Ala Ala Leu His Tyr Ala Leu 
310 315 320 325 

etc age tgc etc ace tgg atg gee ate gag ggc ttc aac etc tac etc 1064 
Leu Ser Cys Leu Thr Trp Met Ala He Glu Gly Phe Asn Leu Tyr Leu 
330 335 340 

etc etc ggg cgt gtc tac aac ate tac ate cgc aga tat gtg ttc aag 1112 
Leu Leu Gly Arg Val Tyr Asn lie Tyr lie Arg Arg Tyr Val Phe Lys 
345 350 355 

ctt ggt gtg eta ggc tgg ggg gec cca gec etc ctg gtg ctg ctt tec 1160 
Leu Gly Val Leu Gly Trp Gly Ala Pro Ala Leu Leu Val Leu Leu Ser 
360 365 370 

etc tct gtc aag age teg gta tac gga ccc tgc aca ate ccc gtc ttc 12 08 

Leu Ser Val Lys Ser Ser Val Tyr Gly Pro Cys Thr He Pro Val Phe 
375 380 385 

gac age tgg gag aat ggc aca ggc ttc cag aac atg tec ata tgc tgg 1256 
Asp Ser Trp Glu Asn Gly Thr Gly Phe Gin Asn Met Ser He Cys Trp 
390 395 400 405 

gtg egg age ccc gtg gtg cac agt gtc ctg gtc atg ggc tac ggc ggc 13 04 

Val Arg Ser Pro Val Val His Ser Val Leu Val Met Gly Tyr Gly Gly 
410 415 " 420 
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etc acg tec ctcttc aac ctg gtg gtg ctg gec tgg gcg ctg tgg acc 1352 
Leu Thr Ser Leu Phe Asn Leu Val Val Leu Ala Trp Ala Leu Trp Thr 
425 430 435 

ctg cgc agg ctg egg gag egg gcg gat gca cca agt gtc agg gec tgc 1400 
Leu Arg Arg Leu Arg Glu Arg Ala Asp Ala Pro Ser Val Arg Ala Cys 
440 445 450 

cat gac act gtc act gtg ctg ggc etc acc gtg ctg ctg gga acc acc 1448 
His Asp Thr Val Thr Val Leu Gly Leu Thr Val Leu Leu Gly Thr Thr 
455 460 465 

tgg gee ttg gec ttc ttt tct ttt ggc gtc ttc ctg ctg ccc cag ctg 1496 
Trp Ala Leu Ala Phe Phe Ser Phe Gly Val Phe Leu Leu Pro Gin Leu 
470 475 480 485 

ttc etc ttc acc ate tta aac teg etc tac ggt ttc ttc ctt ttc ctg 1544 
Phe Leu Phe Thr He Leu Asn Ser Leu Tyr Gly Phe Phe Leu Phe Leu 
490 495 500 

tgg ttc tgc tec cag egg tgc cgc tea gaa gca gag gee aag gca cag 1592 
Trp Phe Cys Ser Gin Arg Cys Arg Ser Glu Ala Glu Ala Lys Ala Gin 
505 510 515 

ata gag gee ttc age tec tec caa aca aca cag tagtceggge ctcctggcct 164 5 
He Glu Ala Phe Ser Ser Ser Gin Thr Thr Gin 
520 525 

ggaatcctca gcctctctgg ccgccagtag cctgaggcta cggctcctgc tagagagggt 1705 

ggcaggcctg ctgctggacc ccagaggcca ctgtgaccgc caaggggect tttccacttc 1765 

cacggcctct ccaggcactg aggggaaggc attgetctae ctctccctga cattttgetc 1825 

eggggcagat ccaaccttac ctggggcagc aaactttgtc ctggtacctg ggcccagctc 1885 

gecagggatg tgggcagagc accagcctgg gcatcaggaa gecaagttte aaggactgtc 1945 

tttgagtctg tctgtatgac cttgggcctg ccacttctca cagaccctag gtatccacag 2005 

ctgtgacatg ggggcaagcg gctttgtttc agcctaaccc aggagcttag taaaaattgc 2 065 

ataagaccag ggggaagagt gtcagegtgg ggtgggaatt cccgcggcct ccacctgctt 2125 

gctaggggca ggatctcatt caggctgccc tggaagcacc tgcttggccc tgccac.cttc 2185 

ctccagggga gggccagatg gcatcctggc ttggggcggg tgggacctac ccaggctctg 2245 

agactttact ggcctatgcc tgaggectet tttcctttaa ctccctaaat tatgatgact 2305 

ccaagtccaa gcccaccctt cccaaagatt gggaggttcc gccgttccca gaggctcctc 2365 

ctgcggtgct cccaagactt ccatagacca tctggaccag tagcccatcc cgcagttttc 2425 

ttgggggcag aggaaaaege ttctttctcc tccagctgaa tcagctggat : cccagtgtcc 2485 

tggetgtttg gtgattgggc aagattgaat ttgcccaggt aggegtgaga gtgtgggttt 2545 

taaattcgaa gctcaggcca tagtttcaga gaatcaccct taccccagac. cttcatgaga 2605 

cagtgetcat gaagccagtg cgtttcccag aacgaacact aggcggcacc gttggtccac 2665 

actcagaggc ccttggcgcc aagactgeat etagaatege tcaaacacct gtttgeagae 2725 

cccatgcacc agctggaggg geegtaactg caggactgeg cctactgagt gacccatttc 2785 

ctccaggagg aaaggcaaga cacgcttaca eggecatttg tctcttttcc caatgeggeg 2845 

gtgeacttte gctcttgggg gctgcacccc agacatagct ggcaccagag cagggtgetc 2 905 

aggtggtggg tgetcaggge cctgccccag gccactgggc cgttttgatg acctcgaagg 2965 

t'cacaggcag aaaataggag caggatttcc cctggggaaa agttctcctg ggacatcttc 3025 

tgctcttctg tacatttcta gatgeaaata actccttcac caggcagtga gtggcgtagg 3085 

ctctggagcc aggctgectg ggctccaatg ccagctctgc cacttgetag ctgtgagact 3145 

gtggacaaac cactcagcct ctgtgtgcct cagttttcct atttgtaaaa tagaggecat 32 05 

agtggtacct attttgaaga ctaagtaaaa gaattcaaat aaagagactt ggc 3258 

<210> 29 
<211> 528 
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<212> PRT 

<213 > Homo sapiens 



<400> 


29 










Met Asp His 


Cys 


Gly 


Ala 


Leu 


Phe 


1 




5 








Gin Asn Ala 


Thr 


Thr 


Glu 


Thr 


Trp 




20 










Asn Met Gin 


Val 


Ser 


Arg 


Gly 


Arg 


35 










40 


Leu His Gin 


Leu 


Glu 


Gin 


Met 


Leu 


50 








55 




Asn Leu Thr 


Leu 


Gin 


Thr 


Pro 


Thr 


65 






70 






Ser Cys Asp 


Phe 


Ser 


Gly 


Leu 


Ser 






85 








Val Pro Gin 


Ala 


Gly 


Glv 


Gin 


His 


Phe Pro Ala 


100 










Glu 


Leu 


Thr 


Arg 


Asn 


115 










120 


Leu Arg Leu 


tie 


Cys 


He 


Tyr 


Phe 


130 








135 




Glu Asn Asn 


Ser 


Ser 


Leu 


Leu 


Asn 


145 






150 






Ser His Gly 


His 


Val 


Asn 


Asn 


Leu 






165 








Trp His Asn 


Gin 


Ser 


Leu 


Glu 


Glv 




180 










Lys Glu Gly 


Ala 


Arg 


Lys 


Gin 


Pro 


195 










200 


Cys Arg Thr 


Glu 


Gin 


Pro 


Ser 


His 


210 








215 




His Leu Thr 


Tyr 


Phe 


Ala 


Val 


Leu 


225 






230 






Pro Ala Glu 


Leu 


Leu 


Ala 


Pro 


Leu 






245 








Ser lie Ser 


He 


Val 


Ala 


Ser 


Leu 




260 










Phe Arg Lys 


Gin 


Ser 


Asp 


Ser 


Leu 


275 










280 


Ala Ser Val 


Leu 


Leu 


Leu 


Asn 


He 


290 








295 




Ala Met Ser 


Pro 


Val 


Pro 


Gly 


Ser 


305 






310 






Leu His Tyr 


Ala 


Leu 


Leu 


Ser 


Cys 






325 








Phe Asn Leu 


Tyr 


Leu 


Leu 


Leu 


Gly 




340 










Arg Tyr Val 


Phe 


Lys 


Leu 


Gly 


Val 


355 










360 


Leu Val Leu 


Leu 


Ser 


Leu 


Ser 


Val 


370 








375 




Thr lie Pro 


Val 


Phe 


Asp 


Ser 


Trp 


385 






390 






Met Ser He 


Cys 


Trp 


Val 


Arg 


Ser 






405 








Met Gly Tyr 


Gly 


Gly 


Leu 


Thr 


Ser 



420 



Leu 


Cys Leu 


Cys 


Leu 


Leu 


Thr 


Leu 




10 








15 




Glu 


Glu Leu 


Leu 


Ser 


Tyr 


Met 


Glu 


25 








30 






Ser 


Ser Val 


Phe 


Ser 


Ser Arg 


Gin 








45 








Leu 


Asn Thr 


Ser 


Phe 


Pro 


Gly Tyr 






60 










lie 


Gin Ser 


Leu 


Ala 


Phe 


Lys 


Leu 




75 










80 


Leu 


Thr Ser 


Ala 


Thr 


Leu Lys 


Arg 




90 








95 




Ala Arg Gly 


Gin 


His 


Ala 


Met 


Gin 


105 








110 






Ala Cys Lys 


Thr 


Arg 


Pro Arg 


Glu 








125 








Ser Asn Thr 


His 


Phe 


Phe 


Lys 


Asp 






14 0 










Asn Tyr Val 


Leu 


Glv 


Ala 


Gin 


Leu 




155 










160 


Arg 


Asp Pro 


Val 


Asn 


He 


Ser 


Phe 




170 








175 




Tyr 


Thr Leu 


Thr 


Cys 


Val 


Phe 


Trp 


185 








190 






Trp Gly Gly 


Trp 


Ser 


Pro Glu Gly 








205 








Ser 


Gin Val 


Leu 


Cys 


Arg 


Cys 


Asn 






220 










Met 


Gin Leu 


Ser 


Pro 


Ala 


Leu 


Val 




235 










240 


Thr 


Tyr He 


Ser 


Leu 


Val Gly Cys 




250 








255 




lie 


Thr Val 


Leu 


Leu 


His 


Phe 


His 


265 








270 






Thr 


Arg He 


His 


Met 


Asn 


Leu 


His 








285 








Ala 


Phe Leu 


Leu 


Ser 


Pro 


Ala 


Phe 






300 










Ala 


Cys Thr 


Ala 


Leu 


Ala 


Ala 


Ala 




315 










320 


Leu 


Thr Trp 


Met 


Ala 


He 


Glu 


Gly 




330 








3 35 




Arg 


Val Tyr 


Asn 


He 


Tyr 


lie 


Arg 


345 








350 






Leu 


Gly Trp 


Gly 


Ala 


Pro 


Ala 


Leu 








365 








Lys 


Ser Ser 


Val 


Tyr 


Gly 


Pro 


Cys 






380 










Glu 


Asn Gly 


Thr 


Gly 


Phe 


Gin 


Asn 




395 










400 


Pro 


Val Val 


His 


Ser 


Val 


Leu 


Val 




410 








415 




Leu 


Phe Asn 


Leu 


Val 


Val 


Leu 


Ala 


425 








430 







40 
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Trp 


Ala 


Leu 


Trp 


Thr 


Leu 


Arg Arg 






435 










440 


Ser 


Val 
450 


Arg 


Ala 


Cys 


His 


Asp 
455 


Thr 


Leu 


Leu 


Gly 


Thr 


Thr 


Trp 


Ala 


Leu 


465 










470 






Leu 


Leu 


Pro 


Gin 


Leu 
485 


Phe 


Leu 


Phe 


Phe 


Phe 


Leu 


Phe 
500 


Leu 


Trp 


Phe 


Cys 


Glu 


Ala 


kys 
515 


Ala 


Gin 


lie 


Glu 


Ala 
520 



Leu 


Arg 


Glu 


Arg 


Ala 


Asp 


Ala Pro 










445 






Val 


Thr 


Val 


Leu Gly 


Leu 


Thr Val 








460 








Ala 


Phe 


Phe 


Ser 


Phe 


Gly 


Val Phe 






475 








480 


Thr 


He 


Leu 


Asn 


Ser 


Leu 


Tyr Gly 




490 










495 


Ser 


Gin 


Arg 


Cys 


Arg 


Ser 


Glu Ala 


505 










510 




Phe 


Ser 


Ser 


Ser 


Gin 


Thr 


Thr Gin 



525 



<210> 30 

<211> 1584 

<212> DNA 

<213> Homo sapiens 



<400> 30 

atggatcact gtggtgccct tttcctgtgc ctgtgccttc tgactttgca gaatgcaaca 60 

acagagacat gggaagaact cctgagctac atggagaata tgcaggtgtc caggggccgg 120 

agctcagttt tttcctctcg tcaactccac cagctggagc agatgctact gaacaccagc 180 

ttcccaggct acaacctgac cttgcagaca cccaccatcc agtctctggc cttcaagctg 240 

agctgtgact tctctggcct ctcgctgacc agtgccactc tgaagcgggt gccccaggca 300 

ggaggtcagc atgcccgggg tcagcacgcc atgcagttcc ccgccgagct gacccgggac 360 

gcctgcaaga cccgccccag ggagctgcgg ctcatctgta tctacttctc caacacccac 42 0 

tttttcaagg atgaaaacaa ctcatctctg ctgaataact acgtcctggg ggcccagctg 480 

agtcatgggc acgtgaacaa cctcagggat cctgtgaaca tcagcttctg gcacaaccaa 540 

agcctggaag gctacaccct gadctgtgtc ttctggaagg agggagccag gaaacagccc 600 

^-9999999^ ggagccctga gggctgtcgt acagagcagc cctcccactc tcaggtgctc 660 

tgccgctgca accacctcac ctactttgct gttctcatgc aactctcccc agccctggtc 720 

cctgcagagt tgctggcacc tcttacgtac atctccctcg tgggctgcag catctccatc 780 

gtggcctcgc tgatcacagt cctgctgcac ttccatttca ggaagcagag tgactcctta 840 

acacgcatcc acatgaacct gcatgcctcc gtgctgctcc tgaacatcgc cttcctgctg 900 

agccccgcat tcgcaatgtc tcctgtgccc gggtcagcat gcacggctct ggccgctgcc 960 

ctgcactacg cgctgctcag cfcgcctoacc tggatggcca tcgagggctt caacctctac 1020 

ctcctcctcg ggcgtgtcta caacatctac atccgcagat atgtgttcaa gcttggtgtg 1080 

ctaggctggg gggccccagc cctcctggtg ctgctttccc tctctgtcaa gagctcggta 114 0 

tacggaccct gcacaatccc cgtcttcgac agctgggaga atggcacagg cttccagaac 1200 

atgtccatat gctgggtgcg gagccccgtg gtgcacagtg tcctggtcat gggctacggc 1260 

ggcctcacgt ccctcttcaa cctggtggtg ctggcctggg cgctgtggac cctgcgcagg 1320 

ctgcgggagc gggcggatgc accaagtgtc agggcctgcc atgacactgt cactgtgctg 13 80 

ggcctcaccg tgctgctggg aaccacctgg gccttggcct tcttttcttt tggcgtcttc 1440 

ctgctgcccc agctgttcct cttcaccatc ttaaactcgc tctacggttt cttccttttc 1500 

ctgtggttct gctcccagcg gtgccgctca gaagcagagg ccaaggcaca gatagaggcc 1560 

ttcagctcct cccaaacaac acag 1584 

<210> 31 

<211> 63 

<212> PRT 

<213> Homo sapiens 

<400> 31 

Leu Lys Ser Pro Glu Gly Lys Ser Arg Lys Asn Pro Ala Arg Thr Cys 

1 5 10 15 

Lys Asp Leu Phe Leu Cys His Pro Glu Phe Lys Ser Gly Glu Tyr Trp 

20 25 30 

He Asp Pro Asn Gin Gly Cys He Lys Asp Ala He Lys Val Phe Cys 
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35 40 45 

Asn Lys Arg Phe Glu Thr Gly* Val Gly Glu Thr Cys lie Ser Pro 
50 -55 60 



<210> 32 

<211> 25 

<212> PRT 

<213> Homo sapiens 

<400> 32 

He Ser Asn Val Gin Thr Phe Leu Arg Leu Leu Ser Thr Glu Ala Ser 

1 5 10 15 

Gin Asn He Thr Tyr His Cys Lys Asn 
20 25 

<210> 33 

<211> 33 

<212> PRT 

<213> Homo sapiens 

<400> 33 

Thr Val Leu Gly Glu Asp Gly Cys Ser Ser Arg Thr Gly Glu Trp Gly 

15 10 15 

Lys Thr Val He Glu Tyr Glu Thr Lys Lys Thr Thr Arg Leu Pro He 
20 25 30 

Val 



<210> 34 

<211> 65 

<212> PRT 

<213> Homo sapiens 

<400> 34 

He Asn Thr He Lys Asn Pro Leu Gly Thr Arg Asp Asn Pro Ala Arg 

1 5 10 15 ' 

He Cys Lys Asp Leu Leu Asn Cys Glu Gin Lys Val Ser Asp Gly Lys 

20 25 30 

Tyr Trp He Asp Pro Asn Leu Gly Cys Pro Ser Asp Ala lie Glu Val 

35 40 45 

Phe He Asn Thr Cys Asn Phe Ser Ala Gly Gly Gin Thr Cys Leu Pro 
50 55 60 

Pro 
65 

<210> 35 

<211> 26 

<212> PRT 

<213> Homo sapiens 

<400> 35 

Val Gly Lys Val Gin Met Asn Phe Leu His Leu Leu Ser Ser Glu Ala 

1 "5 10 15 

Thr His He He Thr He His Cys Leu Asn 
20 25 

<210> 36 
<211> 32 
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<212> PRT 

<213> Homo sapiens 



<400> 3<S 

Lys Val Leu Ser Asp Asp Cys Lys He Gin Asp Gly Ser Trp His Lys 

15 10 15 

Ala Thr Phe Leu Phe His Thr Gin Glu Pro Asn Gin Leu Pro Val He 
20 25 30 

<210> 37 

<211> 31 

<212> PRT 

<213> Homo sapiens 

<400> 37 

Gly Glu Ser Val Thr Leu Thr Cys Ser Val Ser Gly Phe Gly Pro Pro 

1 5 10 15 

Pro Val Thr Trp Leu Arg Asn Gly Lys Leu Ser Leu Thr He Ser 
20 25 30 

<210> 38 
<211> 57 
<212> PRT 

<213> Homo sapiens 
<400> 38 

Gly Arg Thr Val Arg Leu Gin Cys Pro Val Glu Gly Asp Pro Pro Pro 

15 10 15 

Thr Met Trp Thr Lys Asp Gly Arg Thr He His Ser Gly Trp Ser Arg 

20 25 30 

Phe Arg Val Leu Pro Gin Gly Leu Lys Val Lys Gin Val Glu Arg Glu 

35 40 45 

Asp Ala Gly Val Tyr Val Cys Lys Ala 
50 55 



<210> 


39 








<211> 


59 








<212> 


PRT 








<213> 


Homo sapiens 








<400> 


39 








Gly Ser Ser 


Val Arg Leu Lys Cys Val Ala Ser Gly His 


Pro 


Arg 


Pro 


1 


5 10 




15 




Asp He Thr 


Trp Met Lys Asp Asp Gin Ala Leu Thr Arg 


Pro 


Glu 


Ala 




20 25 


3 0 






Ala Glu Pro 


Arg Lys Lys Lys Trp Thr Leu Ser Leu Lys 


Asn 


Leu Arg 


35 


40 45 








Pro Glu Asp 


Ser Gly Lys Tyr Thr Cys Arg Val 








50 


55 








<210> 


40 








<211> 


79 








<212> 


PRT 








<213> 


Homo sapiens 








<400> 


40 








Gly Gly Thr 


Thr Ser Phe Gin Cys Lys Val Arg Ser Asp 


Val 


Lys 


Pro 


1 


5 10 




15 
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Val 


lie Gin 


Trp Leu Lys Arg 


Val Glu Tyr Gly Ala 


Glu Gly Arg His 






20 


25 


30 


Asn 


Ser Thr 


He Asp val Gly 


Gly Gin Lys Phe Val 


Val Leu Pro Thr 




35 




40 


45 


Gly 


Asp Val 


Trp Ser Arg Pro 


Asp Gly Ser Tyr Asn 


Lys Leu Leu He 




50 


55 


60 




Thr 


Arg Ala 


Arg Gin Asp Asp 


Ala Gly Met Tyr He 


Cys Leu Gly 


65 




70 


75 






<210> 


41 








<211> 


78 








<212> 


PRT 








<213> 


Homo sapiens 








<400> 


41 






Arg 


Gly Ser 


Leu Thr Val Gin 


Cys Val Tyr Arg Ser 


Gly Trp Glu Thr 


l 




5 


10 


15 


Tyr 


Leu Lys 


Trp Trp Cys Arg 


Gly Ala lie Trp Arg 


Asp Cys Lys He 






20 


25 


30 


Leu 


Val Lys 


Thr Ser Gly Ser 


Glu Gin Glu Val Lys 


Arg Asp Arg Val 




35 




40 


45 


Ser 


He Lys 


Asp Asn Gin Lys 


Asn Arg Thr Phe Thr 


Val Thr Met Glu 




50 


55 


60 




Asp 


Leu Met 


Lys Thr Asp Ala 


Asp Thr Tyr Trp Cys 


Gly lie 


65 




70 


75 






<210> 


42 








<211> 


10 








<212> 


PRT 








<213> 


Homo sapiens 








<400> 


42 






Val 


Phe Val 


Leu Gly Thr Leu 


Gly He Phe 




1 




5 


10 






<210> 


43 








<211> 


10 








<212> 


PRT 








<213> 


Homo sapiens 








<400> 


43 






Val 


Phe He 


Leu Gly Thr Leu 


Leu Leu Trp 




1 




5 


10 






<210> 


44 








<211> 


116 








<212> 


PRT 








<213> 


Homo sapiens 








<400> 


44 






Cys 


Gly Gly 


Thr Leu Asp Leu 


Thr Glu Ser Ser Gly 


Ser He Ser Ser 


1 




5 


10 


15 


Pro 


Asn Tyr 


Pro Asn Arg Ser 


Asp Tyr Pro Pro Asn 


Lys Glu Cys Val 






20 


25 


30 


Trp 


Arg He 


Arg Ala Pro Pro 


Gly Tyr Arg Val Val 


Glu Leu Thr Phe 




35 




40 


45 


Gin 


Asp Phe 


Asp Leu Glu Asp 


His Asp Gly Ala Pro 


Cys Arg Tyr Asp 




50 


55 


60 





44 
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Tvr* 


Val rtiii 


lie 


Arg Asp Gly Asp 


Pro 


Ser 


oer jr'ro j-ieu .Lieu biy Arg 


65 






70 






*7 5 fin 


Phe 




Ser 


Gly Lys Pro Glu 




lie 


/^.iy ser inr oer Asn Arg 








85 




90 


no 


Met 


Ttfti *t Tift 


Lys 


Phe Val Ser Asp 


Ala 


Ser 


vajL oer juys Arg oxy Fue 






100 




105 




*J 1 ft 


L»ys 


Ala J. HIT 


Tyr 












115 
i j- ~j 














<210> 


45 












sill? 


97 












<212> 


PRT 














Homo sapiens 










<400> 


45 










Gly 


Ser Val 


Leu 


Leu Ala Gin Glu 


Leu 


Pro 


Gin Gin Leu Thr Ser Pro 


1 






5 




10 


15 


Gly 


Tyr Pro 


Glu 


Pro Tyr Gly Lys 


Gly Gin 


Glu Ser Ser Thr Asp lie 






20 




25 




30 


Lys 


Ala Pro 


Glu 


Gly Phe Ala Val 


Arg 


Leu 


Val Phe Gin Asp Phe Asp 




35 




. 40 






45 


Leu 


Glu Pro 


Ser 


Gin Asp Cys Ala 


Gly Asp 


Ser Val Thr Val Ser Trp 




50 




55 






60 


Gly 


Trp Gly 


Gly 


Ser Arg Gin Asp 


Cys 


Gly 


Gin Gly Asp Ser Arg Gly 


65 






70 






75 80 


Cys 


Gly Lys 


Trp 


Arg Cys Pro Glu 


Ser 


Pro 


lie Trp Arg Arg Asp Glu 








85 




90 


95 



Phe 





<210> 


46 








<211> 


45 








<212> 


PRT 








<213> 


Homo sapiens 






<400> 


46 






Cys 


Ala Pro 


Asn 


Asn Pro Cys Ser Asn Gly 


Gly Thr Cys Val Asn Thr 


1 






5 10 


15 


Pro 


Gly Gly 


Ser 


Ser Asp Asn Phe Gly Gly 


Tyr Thr Cys Glu Cys Pro 






20 


25 


3 0 


Pro 


Gly Asp 


Tyr 


Tyr Leu Ser Tyr Thr Gly 


Lys Arg Cys 




35 




40 


45 




<210> 


47 








<211> 


67 








<212> 


PRT 








<213> 


Homo sapiens 






<400> 


47 






Trp 


Ser Thr 


Asp 


Lys His lie Gly Gly Arg 


Thr Ser Leu Gly Phe Asn 


1 






5 10 


15 


Leu 


Glu Tyr 


Arg 


lie Arg Val Thr Cys Asp 


Glu Asn Tyr Tyr Gly Glu 






20 


25 


3 0 


Gly 


Cys Asn 


Lys 


Phe Cys Arg Pro Arg Asp 


Asp Ala Phe Gly His Tyr 




35 




40 


45 


Thr 


Cys Asp 


Glu 


Asn Gly Asn Lys Leu Cys 


Leu Glu Gly Trp Lys Gly 




50 




55 


60 



Glu Tyr Gys 

45 
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65 





<210> 


48 








<211> 


59 








<212> 


PRT 








<213> 


jtiOuio sapiens 








<400> 


48 






Cys 


Asp Cys 


Asn Pro His Gly 


Ser Leu Ser Asp Asp Thr Cys 


Asp Ser 


1 




5 


10 


15 


Asp 


Asp Glu 


Leu Phe Gly Glu 


Glu Thr Gly Gin Cys Leu Lys 


Cys Lys 






20 


25 30 




Pro 


Asn Val 


Thr Gly Arg Arg 


Cys Asp Arg Cys Lys Pro Gly Tyr Tyr 




35 




40 45 




Gly Leu Pro 


Ser Gly Asp Pro 


Gin Gin Gly Cys 






50 


55 








<210> 


49 








<211> 


31 








<212> 


PRT 








<213> 


Homo sapiens 








<400> 


49 






Cys 


Val Pro 


Leu Cys Ala Gin 


Glu Cys Val His Gly . Arg Cys 


Val Ala 


1 




5 


10 


15 


Pro 


Asn Gin 


Cys Gin Cys Val 


Pro Gly Trp Arg Gly Asp Asp 


Cys 






20 


25 30 






<210> 


50 








<211> 


30 








<212> 


PRT 








<213> 


Homo sapiens 








<400> 


50 






Cys 


Gin Phe 


Arg Cys Gin Cys 


His Gly Ala Pro Cys Asp Pro Gin Thr 


1 




5 


10 


15 


Gly Ala Cys 


Phe Cys Pro Ala 


Glu Arg thr Gly Pro Ser Cys 








20 


25 30 






<210> 


51 








<211> 


31 








<212> 


PRT 








<213> 


Homo sapiens 








<400> 


51 






Cys 


Pro Ser 


Thr His Pro Cys 


Gin Asn Gly Gly Val Phe Gin 


Thr Pro 


1 




5 


10 


15 


Gin Gly Ser 


Cys Ser Cys Pro 


Pro Gly Trp Met Gly Thr lie 


Cys 






20 


25 30 






<210> 


52 








<211> 


31 








<212> 


PRT 








<213> 


Homo sapiens 








<400> 


52 








Ser Gin 


Glu Cys Arg Cys 


His Asn Gly Gly Leu Cys Asp Arg Phe 


1 




5 


10 


15 
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Thr Gly Gin 



Cys Arg Cys Ala Pro Gly Tyr Thr Gly Asp Arg Cys 
20 25 30 



<210> 53 

<211> 31 

<212> PRT 

<213> Homo sapiens 

<400> 53 

Cys Ala Glu Thr Cys Asp Cys Ala Pro Asp Ala Arg Cys Phe Pro Ala 

1 5 10 15 

Asn Gly Ala Cys Leu Cys Glu His Gly Phe Thr Gly Asp Arg Cys 
20 25 30 

<210> 54 
<211> 27 
<212> PRT 

<213> Homo sapiens 
<400> 54 

Cys Asp Arg Glu His ser Leu Ser Cys His Pro Met Asn Gly Glu Cys 

1 5 10 15 

Ser Cys Leu Pro Gly Trp Ala Gly Leu His Cys 
20 25 

<210> 55 

<211> 31 

<212> PRT 

<213> Homo sapiens 

<400> 55 

Cys Gin Glu His Cys Leu Cys Leu His Gly Gly Val Cys Gin Ala Thr 

1 5 10 15 

Ser Gly Leu Cys Gin Cys Ala Pro Gly Tyr Thr Gly Pro His Cys 
20 25 30 

<210> 56 
<211> 31 
<212> PRT 

<213> Homo sapiens 

<400> 56 ■ . 

Cys Ser Ala Arg Cys Ser Cys Glu Asn Ala He Ala Cys Ser Pro He 

1 5 10 ** 15 

Asp Gly Glu Cys Val Cys Lys Glu Gly Trp Gin Arg Gly Asn Cys 
20 25 30 

<210> 57 
<2ll> 31 
<212> PRT 

<213> Homo sapiens 
<400> 57 

Cys Asn Ala Ser Cys Gin Cys Ala His Glu Ala Val Cys Ser Pro Gin 

1 5 10 15 

Thr Gly Ala Cys Thr Cys Thr Pro Gly Trp His Gly Ala His Cys 
20 25 30 
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<210> 58 

<211> 31 

<212> PRT 

<213> Homo sapiens 



<400> 58 
Cys Ala Ser Arg Cys Asp Cys Asp 

1 5 
His Gly Arg Cys Gin Cys Gin Ala 
20 



His Ser Asp Gly Cys Asp Pro Val 

10 15 
Gly Trp Met Gly Ala Arg Cys 
25 30 



<210> 59 

<211> 31 

<212> PRT 

<213> Homo sapiens 



<400> 59 
Cys Ser Asn Thr Cys Thr Cys Lys 

1 5 
Asn Gly Asn Cys Val Cys Ala Pro 
20 



Asn Gly Gly Thr Cys Leu Pro Glu 

10 15 
Gly Phe Arg Gly Pro Ser Cys 
25 30 



<210> 60 

<211> 30 

<212> PRT 

<213> Homo sapiens 

<400> 60 

Cys Val Pro Cys Lys Cys Ala Asn His Ser Phe Cys His Pro Ser Asn 

1 S 10 15 

Gly Thr Cys Tyr Cys Leu Ala Gly Trp Thr Gly Pro Asp Cys 
20 25 30 

<210> 61 ' 

<211> 31 

<212> PRT 

<213> Homo sapiens 

<400> 61 

Cys Ala Gin Thr Cys Gin Cys His His Gly Gly Thr Cys His Pro Gin 

1 5 10 15 . 

Asp Gly Ser Cys lie Cys Pro Leu Gly Trp Thr Gly His His Cys 
20 25 30 

<210> 62 

<211> 31 

<212> PRT 

<213> Homo sapiens 



<400> 62 

Cys Ser Gin Pro Cys Gin Cys Gly Pro Gly Glu Lys Cys His Pro Glu 

1 5 10 15 

Thr Gly Ala Cys Val Cys Pro Pro Gly His Ser Gly Ala Pro Cys 
20 25 30 



<210> 63 
<211> 37 
<212> PRT 
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<213> Homo sapiens 



<400> 63 

Gin Thr Gly Ala Cys Thr Cys Thr 

1 5 
Gin Leu Pro Cys Pro Lys Gly Gin 
20 

Cys Asp Cys Asp His 
35 



Pro Gly Trp His Gly Ala His Cys 

10 15 

Phe Gly Glu Gly Cys Ala Ser Arg 
25 30 



<210> 64 
<211> 31 
<212> PRT 

<213> Mus mug cuius 

<400> 64 
Cys Ser Asn Thr Cys Thr Cys Lys 

1 5 
Asn. Gly Asn Cys Val Cys Ala Pro 
20 

<210> 65 
<211> 31 
<212> PRT 

<213> Mus mus cuius 



Asn Gly Gly Thr Cys Val Ser Glu 

10 15 

Gly Phe Arg Gly Pro Ser Cys 
25 " 30 



<400> 65 

Cys Val Gin Cys Lys Cys Asn Asn Asn His Ser Ser Cys His Pro Ser 

1 5 10 15 

Asp Gly Thr Cys Ser Cys Leu Ala Gly Trp Thr Gly Pro Asp Cys 
20 25 30 



<210> 66 
<211> 31 
<212> PRT 

<213> Mus musculus 

<400> 66 
Cys Ser Gin Leu Cys Gin Cys His 

1 5 
Asp Gly Ser Cys lie Cys Thr Pro 
20 

<210> 67 

<211> 31 

<212> PRT 

<213> Mus musculus 



His Gly Gly Thr Cys His Pro Gin 

10 15 
Gly Trp Thr Gly Pro ABn Cys 
25 30 



<400> 67 
Cys Ser Gin Leu Cys Gin Cys Asp 

1 5 
Thr Gly Ala Cys Val Cys Pro Pro 
20 



Leu Gly Glu Met Cys His Pro Glu 

10 15 
Gly His Ser Gly Ala Asp Cys 
25 30 



<210> 68 

<211> 35 . 

<212> PRT 

<213> Mus musculus 
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<400> 68 

His Ala Ser Gly Asp Pro Val His Gly Gin Cys Arg Cys Gin Ala Gly 

1 5 10 15 

Trp Met Gly Thr Arg Cys His Leu Pro Cys Pro Glu Gly Phe Trp Gly 
20 25 30 

Ala Asn Cys 
35 

<210> 69 
<211> 40 
<212> PRT 

<213> Mus musculus 
<400> 69 

Cys Thr Cys Lys Asn Gly Gly Thr Cys Val Ser Glu Asn Gly Asn Cys 

1 5 10 15 

Val Cys Ala Pro Gly Phe Arg Gly Pro Ser Cys Gin Arg Pro Cys Pro 

20 25 30 

Pro Gly Arg Tyr Gly Lys Arg Cys 
35 40 

<210> 70 

<211> 35 

<212> PRT 

<213> Mus musculus 

<400> 70 

Cys Lys Cys Asn Asn Asn His Ser Ser Cys His Pro Ser Asp Gly Thr 

15 10 15 

Cys Ser Cys Leu Ala Gly Trp Thr Gly Pro Asp Cys Ser Glu Ala Cys 
20 25 30 

Pro Pro Gly 
35 

<210> 71 
<211> 34 
<212> PRT 

<213> Mus musculus 
<400> 71 

Cys Gin Cys His His Gly Gly Thr Cys His Pro Gin Asp Gly Ser Cys 

1 5 10 15 

lie Cys Thr Pro Gly Trp Thr Gly Pro Asn Cys Leu Glu Gly Cys Pro 
20 25 30 

Pro Arg 



<210> 72 
<211> 58 
<212> PRT 

<213> Mus musculus 
<400> 72 

His Gly Gin Cys Arg Cys Gin Ala Gly Trp Met Gly Thr Arg Cys His 

1 5 10 15 

Leu Pro Cys Pro Glu Gly Phe Trp Gly Ala Asn Cys Ser Asn Thr Cys 

20 ~ 25 "* 30 

Thr Cys Lys Asn Gly Gly Thr Cys Val Ser Glu Asn Gly Asn Cys Val 



50 
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35 40 45 

Cys Ala Pro Gly Phe Arg Gly Pro Ser Cys 
50 55 

<210> 73 

<211> 28 

<212> PRT 

<213> Rattus sp, 

<400> 73 

Glu Cys Arg Cys His Asn Gly Gly Leu Cys Asp Arg Phe Thr Gly Gin 

1 5 10 15 

Cys His Cys Ala Pro Gly Tyr He Gly Asp Arg Cys 
20 25 

<210> 74 

<211> 31 

<212> PRT 

<213> RattUS sp. 

<400> 74 

Cys Ala Glu Thr Cys Asp Cys Ala Pro Gly Ala Arg Cys Phe Pro Ala 

1 5 10 15 

Asn Gly Ala Cys Leu Cys Glu His Gly Phe Thr Gly Asp Arg Cys 
20 25 30 

<210> 75 

<211> 33 

<212> PRT 

<213> Rattus sp. 

<400> 75 

Cys Gin Asp Pro Cys Thr Cys Asp Pro Glu His Ser Leu Ser Cys His 

1 5 10 15 

Pro Met His Gly Glu Cys Ser Cys Gin Pro Gly Trp Ala Gly Leu His 
20 25 30 

Cys 



<210> 76 

<211> 31 

<212> PRT 

<213> Rattus sp. 

<400> 76 

Cys Gin Glu His Cys Leu Cys Leu His Gly Gly Val Cys Leu Ala Asp 

1 5 10 15 

Ser Gly Leu Cys Arg Cys Ala Pro Gly Tyr Thr Gly Pro His Cys 
20 25 30 

<210> 77 

<211> 31 

<212> PRT 

<213> Rattus sp. 

<400> 77 

Cys Ser Ser His Cys Ser Cys Glu Asn Ala He Ala Cys Ser Pro Val 
1.5 10 15 
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Asp Gly Thr Cys lie Cys Lys Glu Gly Trp Gin Arg Gly Asn Cys 
20 25 30 

<210> 78 

<211> 31 

<212> PRT 

<213> Rattus sp. 

<400> 78 

Cys Asn Ala Ser Cys Gin Cys Ala His Glu Gly Val Cys Ser Pro Gin 

15 10 15 

Thr Gly Ala Cys Thr Cys Thr Pro Gly Trp Arg Gly Val His Cys 
20 25 30 

<210> 79 

<211> 31 

<212> PRT 

<213> Rattus sp* 

<400> 79 

Cys Ala Ser Val Cys Asp Cys Asp His Ser Asp Gly Cys Asp Pro Val 

1 5 10 " 15 

His Gly His Cys Arg Cys Gin Ala Gly Trp Met Gly Thr Arg Cys 
20 25 30 

<210> 80 

<211> 31 

<212> PRT 

<213> Rattus sp. 

<400> 80 

Cys Ser Asn Ala Cys Thr Cys Lys Asn Gly Gly Thr Cys Val Pro Glu 

1 5 10 15 

Asn Gly Asn Cys Val Cys Ala Pro Gly Phe Arg Gly Pro Ser Cys 
20 25 30 

<210> 81 

<211> 30 

<212> PRT 

<213> Rattus sp, 

<400> 81 

Cys Val Pro Cys Lys Cys Asn Asn His Ser Ser Cys His Pro Ser Asp 

1 5 10 15 

Gly Thr Cys Ser Cys Leu Ala Gly Trp Thr Gly Pro Asp Cys 
20 25 30 

<210> 82 

<211> 31 

<212> PRT 

<213> Rattus sp. 

<400> 82 

Cys Ser Gin Pro Cys Gin Cys His His Gly Ala Thr Cys His Pro Gin 

15 10 15 

Asp Gly Ser Cys Val Cys lie Pro Gly Trp Thr Gly Pro Asn Cys 
20 25 30 
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<210> 83 

<211> 31 

<212> PRT 

<213> Rattus sp. 

<400> 83 

Cys Ser Gin Leu Cys Gin Cys Asp Pro Gly Glu Met Cys His Pro Glu 

1 5 10 15 

Thr Gly Ala Cys Val Cys Pro Pro Gly His Ser Gly Ala His Cys 
20 25 30 

<210> 84 

<211> 40 

<212> PRT 

<213> Rattus sp. 

<400> 84 

Cys Arg Cys His Asn Gly Gly Leu Cys Asp Arg Phe Thr Gly Gin Cys 

1 5 10 15 

His Cys Ala Pro Gly Tyr lie Gly Asp Arg Cys Arg Glu Glu Cys Pro 

20 25 30 

Val Gly Arg Phe Gly Gin Asp Cys 
35 40 

<210> 85 

<211> 39 

<212> PRT 

<213> Rattus sp. 

<400> 85 

Cys Asp Cys Ala Pro Gly Ala Arg Cys Phe Pro Ala Asn Gly Ala Cys 

1 5 10 15 

Leu Cys Glu His Gly Phe Thr Gly Asp Arg Cys Thr Glu Arg Leu Cys 

20 25 30 

Pro Asp Gly Tyr Gly Leu Cys 
35 

<210> 86 

<211> 42 

<212> PRT 

<213> Rattus sp. 

<400> 86 

Cys Thr Cys Asp Pro Glu His Ser Leu Ser Cys His Pro Met His Gly 

1 5 10 15 

Glu Cys Ser Cys Gin Pro Gly Trp Ala Gly Leu His Cys Asn Glu Ser 

20 25 30 

Cys Pro Gin Asp Thr His Gly Ala Gly Cys 
35 40 

<210> 87 

<211> 40 

<212> PRT 

<213> Rattus sp. 

<400> 87 

Cys Leu Cys Leu His Gly Gly Val Cys Leu Ala Asp Ser Gly Leu Cys 
1 5 10 15 
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Arg 


Cys Ala 


Pro Gly Tyr Thr Gly Pro His Cys Ala Asn Leu Cys Pro 






20 25 30 


Pro 


Asn Thr 


Tyr Gly lie Asn Cys 




35 


40 




<210> 


88 




<211> 


40 




<212> 


PRT 




<213> 


Rattus sp . 




<400> 


88 


Cvs 


Ser Cys 


Glu Asn Ala He Ala Cys Ser Pro Val Asp Gly Thr Cys 


1 




5 10 15 


He 


Cvs Lvs 


Glu Gly Trp Gin Arg Gly Asn Cys Ser Val Pro Cys Pro 






20 25 30 


Pro 


Gly Thr 


Trp Gly Phe Ser Cys 




35 


40 




<210> 


89 




<211> 


40 




<212> 


PRT 




<213> 


Rattus sp. 




<40O> 


89 


Cys 


Gin Cys 


Ala s Glu Glv Val Cvs Ser Pro Gin Thr Glv Ala Cvs 


1 




5 10 15 


Thr 


Cvs Thr 


Pro Glv TrD Arcr Glv Val His Cvs Gin Leu Pro Cvs Pro 






20 25 30 


Lys 


Gly Gin 


Phe Gly Glu Gly Cys 




35 


40 




<210> 


90 




<211> 


40 




<212> 


PRT 




<213> 


Rattus sp. 




<400> 


90 


Cys 


Asp Cys 


Asp His Ser Asp Gly Cys Asp Pro Val His Gly His Cys 


1 




5 10 15 


Arcr 


Cys Gin 


Ala Gly Trp Met Gly Thr Arg Cys His Leu Pro Cys Pro 






20 25 30 


Glu 


Gly Phe 


Trp Gly Ala Asn Cys 




35 


40 




<210> 


91 




<211> 


40 




<212> 


PRT 




<213> 


Rattus sp . 




<400> 


91 


Cys 


Thr Cys 


Lys Asn Gly Gly Thr Cys Val Pro Glu Asn Gly Asn Cys 


1 




5 10 15 


Val 


Cys Ala 


Pro Gly Phe Arg Gly Pro Ser Cys Gin Arg Pro Cys Pro 






20 ^ 25 30 


Pro 


Gly Arg 


Tyr Gly Lys Arg Cys 




35 


40 




<210> 


92 



54 



BNSDOCID: <WO 0100673A1JB> 



<2ll> 40 

<212> PRT 

<213> Rattus sp. 

<400> 92 

Cys Lys Cys Asn Asn His Ser Ser Cys His Pro Ser Asp Gly Thr Cys 

1.5 10 15 

Ser Cys Leu Ala Gly Trp Thr Gly Pro Asp Cys Ser Glu Ser Cys Pro 

20 25 30 

Pro Gly His Trp Gly Leu Lys Cys 
35 40 

<210> 93 

<211> 40 

<212> PRT 

<213> Rattus sp. 

<400> 93 

Cys Gin cys His His Gly Ala Thr Cys His Pro Gin Asp Gly Ser Cys 

15 10 15 

Val Cys lie Pro Gly Trp Thr Gly Pro Asn Cys Ser Glu Gly Cys Pro 

20 25 30 

Ser Arg Met Phe Gly Val Asn Cys 
35 40 

<210> 94 

<211> 36 

<212> PRT 

<213> Rattus sp. 

<400> 94 
Cys Gin Cys Asp 
1 

Val Cys Pro Pro 
20 

Glu Ser Phe Thr 
35 

<210> 95 
<211> 64 
<212> PRT 
<213> Rattus sp. 

<400> 95 
Gly Val Cys Ser Pro 

1 5 
Arg Gly Val His Cys 
20 

Gly Cys Ala Ser Val 
35 

Val His Gly His Cys 
50 

<210> 96 
<211> 129 
<212> PRT 

<213> Homo sapiens 
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Pro Gly Glu Met Cys His Pro Glu Thr Gly Ala Cys 

5 10 15 

Gly His Ser Gly Ala His Cys Lys Val Gly Ser Gin 
25 30 



Gin Thr Gly Ala Cys Thr Cys Thr Pro Gly Trp 

10 15 
Gin Leu Pro Cys Pro Lys Gly Gin Phe Gly Glu 

25 30 
Cys Asp Cys Asp His Ser Asp Gly Cys Asp Pro 

40 45 
Arg Cys Gin Ala Gly Trp Met Gly Thr Arg Cys 
55 60 
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<400> 96 



Gin 


Glu 


Ser 


Arg 


Ala Gin Lys 


Phe 


Leu 


Arg 


Gin His He 


Asp Ser 


Pro 


1 








5 






10 




15 




Lys 


Thr 


Ser 


Ser 


Ser Asn Pro 


Asn 


Tyr 


Cys 


Asn Gin Met 


Met Asp 


Lys 








20 






25 






30 




Arg 


Arg 


Asn 


Met 


Thr Gin Gin 


Arg 


Cys 


Lys 


Pro Val Asn 


Thr Phe 


Val 






35 






40 






45 






His 


Glu 


Ser 


Leu Ala Asp Val 


Lys 


Ala 


Val 


Cys Ser Gin 


Lys Asn 


Val 




50 






55 








60 






Thr 


cys 


Lys 


Asn 


Gly Gin Ser 


Lys 


Ser 


Ser 


pne Gj.n ixe 


Thr Asp 


Cys 


65 








70 








75 




80 


Arg 


Leu 


Thr 


Gly Gly Ser Gin 


Lys 


Tyr 


Pro 


Asn Cys Arg 


Tyr Arg 


Thr 










85 






90 




95 




Ser 


Ala 


Ser 


Thr 
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Cys 


Ala 


Asn 


Leu 


Cys 








180 










Cys 


Ser 


Ser 


His 


Cys 


Ser 


Cys 


Glu 






195 










200 


Asp 


Gly 


Thr 


Cys 


lie 


Cys 


Lys 


Glu 




210 










215 




Val 


Pro 


Cys 


Pro 


Pro 


Gly 


Thr 


Trp 


225 










230 






Gin 


Cys 


Ala 


His 


Glu 


Gly 


Val 


Cys 










245 








Cys 


Thr 


Pro 


Gly 


Trp 


Arg 


Gly 


Val 








260 










Gly 


Gin 


Phe 


Gly 


Glu 


Gly 


Cys 


Ala 






275 










280 


Asp 


Gly 


Cys 


Asp 


Pro 


Val 


His 


Gly 



Ala 


Val 


Ala 


Tyr 


Ser 


Thr 


Gly 


Arg 








540 










Pro 


Asp 


Val 


Ser 


Pro 


Ser 


Tyr 


Ser 






555 










560 


His 


Thr 


Leu 


Ser 


Gin 


Cys 


Ser 


Pro 




570 










575 




Pro 


Gly 


Ser 


Gin 


Leu 


Phe 


Val 


Ser 


585 










590 






Arg 


Asn 


His 


Gly 


■"•X y 


Asp 


Asn 


His 










605 








His 


A TCI 




Glu 


Ser 


His 


Asp 










62 0 










Gly 


Pro 


Lys 


Val 














635 












Cys 


Jr X <J 


Glu 


vaxy 


PVia 
iriic 


xix a 


Gly 


IrX V 




1U 










XD 




His 


Asn 


Gly 


Gly 


Leu 


Cys 


Asp 


Airg 


A D 










J V 






Pro 


Glv 


TVF 


He 


Gly 


Asp 


Arg 


Cys 


















Phe 


Gly 


Gln 


Asp 


Cys 


Ala 


Glu 


Thr 








ft 
o u 










Cys 




Pro 




Asn 


v?xy 




cys 






/ d 










ft A 


Asp 


Arg 


Cys 


Thr 




Arg 


T 

Leu 


Cys 




y u 














cys 


vJXll 




JrX w 


cys 


Tnr 


Cys 


ASp 


xur> 










1 1 A 

xxu 






Pro 




His 




VjX u. 


y » 


OCl 




















Cys 


Asn 


UXu 


Ser 


Cys 


Pro 




Asp 








Xfr w 










His 


Cys 


Leu 


Cys 


Leu 


His 


Gly 


Gly 






IDS 










X D U 




r*ju y 


Cys 


Ala 


trX W 


Glv 


xyx 


Thr 




X / U 










17c 
X / o 




Pro 


Pro 


Asn 


Thr 


Tvr 


Glv 


lie 


Asn 


185 










190 






Asn 


Ala 


lie 


Ala 


Cys 


Ser 


Pro 


Val 










205 








Gly 


Trp 


Gin 


Arg 


Gly 


Asn 


Cys 


Ser 








220 










Gly 


Phe 


Ser 


Cys 


Asn 


Ala 


Ser 


Cys 






235 










240 


Ser 


Pro 


Gin 


Thr 


Gly 


Ala 


Cys 


Thr 




250 










255 




His 


Cys 


Gin 


Leu 


Pro 


Cys 


Pro 


Lys 


265 










270 






Ser 


Val 


Cys 


Asp 


Cys 


Asp 


His 


Ser 










285 








His 


Cys 


Arg 


Cys 


Gin 


Ala 


Gly 


Trp 
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290 








295 


300 




Met 


Gly 


Thr 


Arg 


Cys 


His Leu Pro Cys 


Pro Glu Gly Phe Trp 


Gly Ala 


305 










310 


315 


320 


Asn 


Cys 


Ser 


Asn 


Ala 


Cys Thr Cys Lys 


Asn Gly Gly Thr Cys 


Val Pro 










325 




330 


335 


Glu 


Asn 


Gly 


Asn 


Cys 


Val Cys Ala Pro 


Gly Phe Arg Gly Pro 


Ser Cys 








340 




345 


350 




Gin 


Arg 


Pro 


Cys 


Pro Pro Gly Arg Tyr 


Gly Lys Arg Cys Val 


Pro Cys 






355 






360 


365 




Lys 


Cys 


Asn 


Asn 


His 


Ser Ser Cys His 


Pro Ser Asp Gly Thr 


Cys Ser 




370 








375 


380 




Cys 


Leu 


Ala 


Gly 


Trp 


Thr Gly Pro Asp 


Cys Ser Glu Ser Cys 


Pro Pro 


385 










390 


395 


400 


Gly 


His 


Trp 


Gly 


Leu 


Lys Cys Ser Gin 


Pro Cys Gin Cys His 


His Gly 










405 




410 


415 


Ala 


Thr 


Cys 


His 


Pro 


Gin Asp Gly Ser 


Cys Val Cys lie Pro 


Gly Trp 








420 




425 


430 




Thr 


Gly 


Pro 


Asn 


Cys 


Ser Glu Gly Cys 


Pro Ser Arg Met Phe 


Gly Val 






435 






440 


445 




Asn 


Cys 


Ser 


Gin 


Leu 


Cys Gin Cys Asp 


Pro Gly Glu Met Cys 


His Pro 




450 








455 


460 




Glu 


Thr 


Gly 


Ala 


Cys 


Val Cys Pro Pro 


Gly His Ser Gly Ala 


His Cys 


465 










470 


475 


480 


Lys 


Val 


Gly 


Ser 


Gin 


Glu Ser Phe Thr 


lie Met Pro Thr Ser 


Pro Val 










485 




490 


4 95 


lie 


His 


Asn 


Ser 











500 





<210> 


123 








<211> 


24 








<212> 


PRT 








<213> 


Rattus sp . 








<400> 


123 






Leu 


Gly Ala 


Val lie Gly He Ala Val 


Leu Gly Thr Leu Val 


Val Ala 


1 




5 


10 


15 


Leu 


Val Ala 


Leu Phe He Gly Tyr 










20 








<210> 


124 








<2ll> 


112 








<212> 


PRT 








<213> 


Rattus sp. 








<400> 


124 






Arg 


His Trp 


Gin Lys Gly Lys Glu His 


Glu His Leu Ala Val 


Ala Tyr 


1 




5 


10 


15 


Ser 


Thr Gly 


Arg Leu Asp Gly Ser Asp 


Tyr Val Met Pro Asp 


Val Ser 






20 25 


30 




Pro 


Ser Tyr 


Ser His Tyr Tyr Ser Asn 


Pro Ser Tyr His Thr 


Leu Ser 




35 


40 


45 




Gin 


Cys Ser 


Pro Asn Pro Pro Pro Pro 


Asn Lys He Pro Gly 


Ser Gin 




50 


55 


60 




Leu 


Phe Val 


Ser Ser Gin Ala Ser Glu 


Arg Pro Asn Arg Asn 


His Gly 


65 




70 


75 


80 


Arg 


Asp Asn 


His Ala Thr Leu Pro Ala 


Asp Trp Lys His Arg 


Arg Glu 






85 


90 


95 


Ser 


His Asp 


Arg Ala Phe Leu Arg His 


Gin Pro Pro Gly Pro 


Lys Val 



74 



100 



105 



110 



<210> 125 
<211> 28 
<212> PRT 

<213> Homo sapiens 
<400> 125 

Met Ala Pro Ala Arg Ala C31y phe Cys Pro Leu Leu Leu Leu Leu Leu 

15 10 is 

Leu Gly Leu Trp Val Ala Glu He Pro Val Ser Ala 
20 25 

<210> 126 

<211> 128 

<212> PRT 

<213> Homo sapiens 



<400> 


126 


























Lys Pro 


Lys 


Gly 


Met 


Thr 


Ser Ser Gin Trp Phe 


Lys 


He 


Gin 


His 


Met 


1 






5 










10 










15 




Gin Pro 


Ser 


Pro 


Gin 


Ala 


Cys 


Asn 


Ser 


Ala 


Met 


Lys 


Asn 


He 


Asn 


Lys 


His Thr 




20 










25 










30 




Lys 


Arg 


Cys 


Lys 


Asp 


Leu 


Asn 


Thr 


Phe 


Leu 


His 


Glu 


Pro 


Phe 




35 










40 










45 








Ser Ser 


Val 


Ala 


Ala 


Thr 


Cys 


Gin 


Thr 


Pro 


Lys 


He 


Ala 


Cys 


Lys 


Asn 


50 










55 










60 






Gly Asp 


Lys 


Asn 


Cys 


His 


Gin 


Ser 


His 


Gly 


Pro 


Val 


Ser 


Leu 


Thr 


Met 


65 








70 










75 










80 


Cys Lys 


Leu 


Thr 


Ser 


Gly Lys 


Tyr 


Pro 


Asn 


Cys 


Arg 


Tyr 


Lys 


Glu 


Lys 








85 










90 










95 


Arg Gin 


Asn 


Lys 


Ser 


Tyr 


Val 


Val 


Ala 


Cys 


Lys 


Pro 


Pro 


Gin 


Lys 


Lys 






100 










105 










110 


Asp Ser 


Gin 


Gin 


Phe 


His 


Leu 


Val 


Pro 


Val 


His 


Leu Asp 


Arg Val 


Leu 




115 










12 0 










125. 









<210> 127 

<211> 19 

<212> PRT 

<213> Homo sapiens 



<400> 127 

Met Pro Leu Leu Thr Leu Tyr Leu Leu Leu Phe Trp Leu Ser Gly Tyr 

1 5 10 15 

Ser He Ala 



<210> 128 
<211> 286 
<212> PRT 

<213> Homo sapiens 

<400> 128 
Thr Gin He Thr Gly Pro Thr Thr 

1 5 
Leu Thr Val Gin Cys Val Tyr Arg 
20 

Trp Trp cys Arg Gly Ala lie Trp 



Val Asn Gly Leu Glu Arg Gly Ser 

10 15 
Ser Gly Trp Glu Thr Tyr Leu Lys 
25 30 
Arg Asp Cys Lys He Leu Val Lys 



75 





35 








40 


Thr Ser 


Gly 


Ser 


Glu Gin 


Glu 


Val 


50 








55 




Asp Asn 


Gin 


Lys Asn Arg Thr Phe 


65 






70 






Lys Thr 


Asp 


Ala Asp Thr Tyr Trp 








85 






Asp Leu 


Gly 


Val 


Thr Val 


Gin Val 






100 








Ala Pro 


Thr 


Thr 


Pro Thr 


Ser 


Thr 




115 








120 


Glu Glu 


Thr 


Ser 


Ser Ser 


Pro 


Thr 


130 








135 




Arg His 


Lys 


Leu 


Leu Lys 


Leu 


Ser 


145 






150 






He Leu 


Leu 


Leu 


Leu Leu 


Val 


Ala 








165 






Met Lys 


Tyr 


Gin Gin Lys Ala 


Ala 






180 








Gin Pro 


Leu 


Glu Gly Asp Leu Cys 




195 








200 


Ala Gly 


Thr 


Ser 


Pro Arg Lys Ala 


210 








215 




Val Asp 


Gin 


Val 


Glu Val 


Glu 


Tyr 


22$ 






230 






Glu Asp 


He 


Ser 


Tyr Ala 


Ser 


Leu 








245 






Pro Thr 


Tyr 


Cys 


Asn Met 


Gly His 






260 








Gly Pro 


Glu 


Glu 


Pro Thr 


Glu 


Tyr 




275 








280 


<210> 


129 








<211> 


150 








<212> 


PRT 








<213> 


Homo sapiens 




<400> 


129 








Thr Gin 


lie 


Thr Gly Pro 


Thr 


Thr 


1 






5 






Leu Thr 


Val 


Gin Cys Val 


Tyr Arg 






20 








Trp Trp 


Cys 


Arg 


Gly Ala 


lie 


Trp 




35 








40 


Thr Ser Gly 


Ser 


Glu Gin 


Glu 


Val 


50 








55 




Asp Asn 


Gin 


Lys 


Asn Arg 


Thr 


Phe 


65 






70 






Lys Thr Asp 


Ala 


Asp Thr 


Tyr 


Trp 








85 






Asp Leu Gly 


Val 


Thr Val 


Gin 


Val 






100 








Ala Pro 


Thr 


Thr 


Pro Thr 


Ser 


Thr 




115 








120 


Glu Glu 


Thr 


Ser 


Ser Ser 


Pro 


Thr 


130 








135 




Arg His Lys 


Leu 


Leu Lys 






145 






150 







Lys Arg Asp Arg Val 


Ser 


He 


Lys 


60 








Thr Val Thr Met Glu 


Asp 


Leu 


Met 


75 






80 


Cys Gly He Glu Lys 


Thr 


Gly Asn 


90 




95 




Thr He Asp Pro Ala 


Ser 


Thr 


Pro 


105 


110 






Thr Phe Thr Ala Pro 


Val 


Thr 


Gin 


125 








Leu Thr Gly His His 


Leu 


Asp 


Asn 


140 








Val Leu Leu Pro Leu 


He 


Phe 


Thr 


155 






160 


Ala Ser Leu Leu Ala 


Trp 


Arg 


Met 


170 




175 




Gly Met Ser Pro Glu 


Gin 


Val 


Leu 


185 


190 






Tyr Ala Asp Leu Thr 


Leu 


Gin 


Leu 


205 








Thr Thr Lys Leu Ser 


Ser 


Ala 


Gin 


220 








Val Thr Met Ala Ser 


Leu 


Pro 


Lys 


235 






240 


Thr Leu Gly Ala Glu 


Asp 


Gin 


Glu 


250 




255 




Leu Ser Ser His Leu 


Pro 


Gly Arg 


265 


270 






Ser Thr He Ser Arg 


Pro 






285 








Val Asn Gly Leu Glu 


Arg 


Gly Ser 


10 




15 




Ser Gly Trp Glu Thr 


Tyr 


Leu Lys 


25 


30 






Arg Asp Cys Lys lie 


Leu 


Val 


Lys 


45 








Lys Arg Asp Arg Val 


Ser 


He 


Lys 


60 








Thr Val Thr Met Glu 


Asp 


Leu 


Met 


75 






80 


Cys Gly He Glu Lys 


Thr 


Gly Asn 


90 




95 




Thr He Asp Pro Ala 


Ser 


Thr 


Pro 


105 


110 






Thr Phe Thr Ala Pro 


Val 


Thr 


Gin 


125 








Leu Thr Gly His His 
i An 


Leu 


Asp 


Asn 



140 



76 
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<210> 130 
<211> 24 
<212> PRT 

<213> Homo sapiens 
<400> 130 

Leu. Ser Val Leu Leu Pro Leu lie Phe Thr lie Leu Leu Leu Leu Leu 

15 10 15 

Val Ala Ala Ser Leu Leu Ala Trp 
20 



<210> 131 
<211> 112 
<212> PRT 

<213> Homo sapiens 





<400> 131 














Arg 


Met 


Met Lys 


Tyr 


Gin 


Gin 


Lys Ala Ala Gly Met Ser Pro 


Glu 


Gin 


1 






5 






10 


15 




val 


Leu 


Gin Pro 


Leu 


Glu 


Gly 


Asp Leu Cys Tyr Ala Asp Leu 


Thr 


Leu 






20 








25 30 






Gin 


Leu 


Ala Gly 


Thr 


Ser 


Pro 


Arg Lys Ala Thr Thr Lys Leu 


Ser 


Ser 






35 








40 45 






Ala 


Gin 


Val Asp 


Gin 


Val 


Glu 


Val Glu Tyr Val Thr Met Ala 


Ser 


Leu 




50 








55 


60 






Pro 


Lys 


Glu Asp 


lie 


Ser 


Tyr 


Ala Ser Leu Thr Leu Gly Ala 


Glu Asp 


65 








70 




75 




80 


Gin 


Glu 


Pro Thr 


Tyr 


Cys 


Asn 


Met Gly His Leu Ser Ser His 


Leu 


Pro 








85 






90 


95 




Gly 


Arg 


Gly Pro 


Glu 


Glu 


Pro 


Thr Glu Tyr Ser Thr lie Ser 


Arg 


Pro 






100 








105 110 








<210> 132 
















<211> 21 
















<212> PRT 















<213> Homo sapiens 
<400> 132 

Met Asp His Cys Gly Ala Leu Phe Leu Cys Leu Cys Leu Leu Thr Leu 

15 10 15 

Gin Asn Ala Thr Thr 
20 

<210> 133 
<211> 507 
<212> PRT 

<213> Homo sapiens 
<400> 133 

Glu Thr Trp Glu Glu Leu Leu Ser Tyr Met Glu Asn Met Gin Val Ser 

15 10 15 

Arg Gly Arg Ser Ser Val Phe Ser Ser Arg Gin Leu His Gin Leu Glu 

20 25 30 

Gin Met Leu Leu Asn Thr Ser Phe Pro Gly Tyr Asn Leu Thr Leu Gin 

35 40 45 

Thr Pro Thr lie Gin Ser Leu Ala Phe Lys Leu Ser Cys Asp Phe Ser 

50 55 60 

Gly Leu Ser Leu Thr Ser Ala Thr Leu Lys Arg Val Pro Gin Ala Gly 
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65 


70 




75 








80 


Gly Gin His 


Ala Arg Gly Gin His Ala Met 


Gin Phe 


Pro 


Ala 


Glu 


Leu 




85 


90 








95 




Thr Arg Asp 


Ala Cys Lys 


Thr Arg Pro Arg 


Glu Leu Arg 


Leu 


He 


Cys 




100 


105 






110 






lie Tyr Phe 


Ser Asn Thr 


His Phe Phe Lys 


Asp Glu Asn 


Asn 


Ser 


Ser 


115 




120 




125 








Leu Leu Asu 


Asn Tyr Val 


Leu Gly Ala Gin 


Leu Ser 


His 


Gly His Val 


130 




135 


140 










Asn Asn Leu 


Arg Asp Pro 


Val Asn He Ser 


Phe Trp 


His 


Asn 


Gin 


Ser 


145 


150 




155 








160 


Leu Glu Gly 


Tyr Thr Leu 


Thr Cys Val Phe 


Trp Lys 


Glu 


Gly Ala 


Ara 




165 


170 








175 




Lys Gin Pro 


Trp Gly Gly Trp ser Pro Glu 


Gly Cys Arg 


Thr 


Glu 


Gin 




180 


185 






190 






Pro Ser His 


Ser Gin Val 


Leu Cys Arg Cys 


Asn His 


Leu 


Thr Tyr 


Phe 


195 




200 




205 








Ala Val Leu 


Met Gin Leu 


Ser Pro Ala Leu 


Val Pro 


Ala 


Glu 


Leu 


Leu 


210 




215 


220 










Ala Pro Leu 


Thr Tyr lie 


Ser Leu Val Gly 


Cys Ser 


He 


Ser 


He 


Val 


225 


230 




235 








240 


Ala Ser Leu 


lie Thr Val 


Leu Leu His Phe 


His Phe 


Arg 


Lys 


Gin 


Ser 




245 


250 








255 




Asp Ser Leu 


Thr Arg He 


His Met Asn Leu 


His Ala 


Ser 


Val 


Leu 


Leu 




260 


265 






270 






Leu Asn lie 


Ala Phe Leu 


Leu Ser Pro Ala 


Phe Ala 


Met 


Ser 


Pro 


Val 


275 




280 




285 








Pro Gly Ser 


Ala Cys Thr 


Ala Leu Ala Ala 


Ala Leu 


His 


Tyr 


Ala 


Leu 


290 




295 


300 










Leu Ser Cys 


Leu Thr Trp 


Met Ala He Glu 


Gly Phe 


Asn 


Leu 


Tyr 


Leu 


305 


310 




315 








320 


Leu Leu Gly 


Arg Val Tyr 


Asn He Tyr He 


Arg Arg 


Tyr 


Val 


Phe 


Lys 




325 


330 








335 




Leu Gly Val 


Leu Gly Trp 


Gly Ala Pro Ala 


Leu Leu 


Val 


Leu 


Leu 


Ser 




340 


345 






350 






Leu Ser Val 


Lys Ser Ser 


Val Tyr Gly Pro 


Cys Thr 


He 


Pro 


Val 


Phe 


355 




360 




3 65 








Asp Ser Trp 


Glu Asn Gly Thr Gly Phe Gin 


Asn Met 


Ser 


He 


Cys 


Trp 


370 




375 


380 










Val Arg Ser 


Pro Val Val 


His Ser Val Leu 


Val Met Gly 


Tyr Gly Gly 


385 


390 




395 








400 


Leu Thr Ser 


Leu Phe Asn 


Leu Val Val Leu 


Ala Trp Ala 


Leu Trp 


Thr 




405 


410 








415 




Leu Arg Arg 


Leu Arg Glu Arg Ala Asp Ala 


Pro Ser 


Val 


Arg Ala Cys 




420 


425 






430 






His Asp Thr 


Val Thr Val Leu Gly Leu Thr 


Val Leu 


Leu 


Gly Thr Thr 


435 




440 




445 








Trp Ala Leu 


Ala Phe Phe 


Ser Phe Gly Val 


Phe Leu 


Leu 


Pro 


Gin 


Leu 


450 




455 


460 










Phe Leu Phe 


Thr He Leu Asn Ser Leu Tyr 


Gly Phe Phe 


Leu 


Phe 


Leu 


465 


470 




475 








480 


Trp Phe Cys 


Ser Gin Arg Cys Arg Ser Glu 


Ala Glu 


Ala 


Lys 


Ala 


Gin 




485 


490 








495 




lie Glu Ala 


Phe Ser Ser 


Ser Gin Thr Thr 


Gin 












500 


505 













<210> 134 
<211> 223 
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<212> PRT 

<213> Homo, sapiens 



<400> 134 



Glu 


Thr 


Trp 


Glu 


Glu 


Leu 


Leu 


Ser 


1 








5 








Arg. 


Gly 


Arg 


Ser 


Ser 


Val 


Phe 


Ser 








20 










Gin 


Met 


Leu 


Leu 


Asn 


Thr 


Ser 


Phe 






35 










40 


Thr 


Pro 


Thr 


lie 


Gin 


Ser 


Leu 


Ala 




50 










55 




Gly 


Leu 


Ser 


Leu 


Thr 


Ser 


Ala 


Thr 


65 










70 






Gly 


Gin 


His 


Ala 


Arg 


Gly 


Gin 


His 










85 








Thr 


Arg 


Asp 


Ala 


Cys 


Lys 


Thr Arg 








100 










lie 


Tyr 


Phe 


Ser 


Asn 


Thr 


His 


Phe 






115 










120 


Leu 


Leu 


Asn 


Asn 


Tyr 


Val 


Leu Gly 




130 










135 




Asn 


Asn 


Leu 


Arg 


Asp 


Pro 


Val 


Asn 


145 










150 






Leu 


Glu 


Gly 


Tyr 


Thr 


Leu 


Thr Cys 










165 








Lys 


Gin 


Pro 


Trp 


Gly 


Gly 


Trp 


Ser 








180 










Pro 


Ser 


His 


Ser 


Gin 


Val 


Leu 


Cys 






195 










200 


Ala 


Val 


Leu 


Met 


Gin 


Leu 


Ser 


Pro 




210 










215 





Tyr Met Glu 


Asn Met Gin Val 


Ser 


10 




15 




Ser Arg Gin 


Leu His Gin 


Leu 


Glu 


25 


30 






Pro Gly Tyr 


Asn Leu Thr 


Leu 


Gin 




45 






Phe Lys Leu 


Ser Cys Asp 


Phe 


Ser 




60 






Leu Lys Arg 


Val Pro Gin 


Ala 


Gly 


75 






80 


Ala Met Gin 


Phe Pro Ala 


Glu 


Leu 


90 




95 




Pro Arg Glu 


Leu Arg Leu 


He 


Cys 


105 


110 






Phe Lys Asp 


Glu Asn Asn 


Ser 


Ser 




125 






Ala Gin Leu 


Ser His Gly His 


Val 




140 






He Ser Phe 


Trp His Asn 


Gin 


Ser 


155 






160 


Val Phe Trp 


Lys Glu Gly Ala 


Arg 


170 




175 




Pro Glu Gly 


Cys Arg Thr Glu 


Gin 


185 


190 






Arg Cys Asn 


His Leu Thr 


Tyr 


Phe 




205 






Ala Leu val 


Pro Ala Glu 


Leu 





220 





<210> 


13 5 




<211> 


25 




<212> 


PRT 




<213> 


Homo sapiens 




<400> 


135 


Leu 


Ala Pro 


Leu Thr Tyr lie Ser Leu Val Gly 


1 




5 10 


Val 


Ala Ser 


Leu He Thr Val Leu Leu 






20 25 




<210> 


136 




<211> 


20 




<212> 


PRT 




<213> 


Homo sapiens 



15 



<400> 136 

Leu His Ala Ser Val Leu Leu Leu Asn He Ala Phe Leu Leu Ser Pro 

1 5 10 15 

Ala Phe Ala Met 
20 

<210> 137 
<211> 21 
<212> PRT 
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<213> 


Homo sapiens 




<400> 


137 


Tyr 


Ala Leu 


Leu Ser Cys Leu Thr Trp Met Ala lie Glu Gly Phe Asn 


1 




5 10 15 


Leu 


Tyr Leu 


Leu Leu 






20 




<210> 


138 




<2 11> 


19 




<212> 


PRT 




<213> 
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1. rn Claims Nos.: 

— because they relate to subject matter not required to be searched by this Authority, namely: 



2. p~| Claims No*.: 

1 — 1 because they relate to parts of the international application that do not comply with the prescribed requirements to such 
an extent that no meaningful international search can be carried out, specifically: 



3. Claims Nos.i 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 
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BOX U. OBSERVATIONS WHERE UNITY OP INVENTION WAS LACKING 
Thit ISA found multiple inventions a* follows: 

This application contain* the following invention* or groups of inventions which are not to linked as to form a tingle 
inventive concept under PCT Rule 13.1. In order for all invention! to be searched, the appropriate additional search 
fees must be paid. 

Group I, claim(t)MO and 12, in so far as they are drawn to Intercept 340, polynucleotides of SEQ ID NOS: 1 and 3, 
vector* host cell, method of producing a protein recombinantly and protein of SEQ ID NO: 2. 

Groups II-VII, elaim(s) 1-10 and 12, in so far as they are drawn to the next six polynucleotides of distinct cDNA clones 
and encoded proteins, identified as Mango 003, Mango 347, Tango 272, Tango 295, Tango 354 and Tango 378, as 
listed in Tables 1 and 2. 

Groups VHI-XIV, claim(s) 11 and 15, in so far as they are drawn to antibodies to one of the seven proteins listed 
above. 

Groups XV-XXI, claims 13, 14, 19, 20 and 22, in so far as they sre drawn to a method for detecting the presence of in 
a sample or identifying a compound which binds to or modulates the activity of a polypeptide of one of the seven 
proteins listed above. 

Groups XXH-XXVH, claims 16 and 17, in so far as they are drawn to a method for detecting the nucleic acids of one 
of the seven cDN A clones listed above. 

Groups XXJXOCXXV, claim 18, in so far as it is drawn to a kit comprising a compound of unspecified constitution 
which selectively binds to a nucleic acid molecule of the seven cDN A clones listed above. 

Groups XXXVI-XL1I, claim 21, in so far as it it drawn to a method for modulating the activity of one of the seven 
proteins listed above. 



The inveuuo*** listed as Groups I-XLtt do not relate to a single inventive concept under PCT Rule 13.1 becaute, under 
PCT Rule 13.2, they lack the same or corresponding special technical features for the following reasons: Group I 
corresponds to the first invention wherein the first product it the polynucleotide and the first method of using is the 
method of making the protein. Note that there is no method of making the polynucleotide. The invention also includes 
the protein made. Each of groups H-VII does not share the same or corresponding special technical feature because 
each group is drawn to a different polynucleotide and encoded protein, and each of groupt VIII-XLH does not share the 
same or corresponding special technical feature because each group is drawn to different compounds or methods of 
using the seven polynucleotides and encoded proteint. Thit Authority therefore considers that the aeveral inventions do 
not share a special technical feature within the meaning of PCT Rule 13.2 and thus do not relate to a single general 
inventive concept within the meaning of PCT Rule 13. 1 . 
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